OpenDataLab

OpenDataLab

High-quality datasets and tools for AI

Forums

OpenDataLab

1yr ago

MinerU:One-stop Data Extraction Tool - PDF Document Extraction;Web Page & E-book Extraction

MinerU is a one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction. Complex multi-modal documents mixed with pictures, tables, formulas, etc. into clear and easy-to-analyze Markdown format.