OpenDataLab

OpenDataLab

High-quality datasets and tools for AI

About

OpenDataLab Building an AI open data ecosystem to fully support large models with data elements. OpenDataLab, established by the Shanghai AI Lab's large model database team, is the chosen platform for Chinese Large Model Corpus Data Alliance's open data services. It offers comprehensive AI data support to developers, mitigating data processing risks and fostering AI research and applications

Badges

Tastemaker
Tastemaker
Gone streaking
Gone streaking

Forums

OpenDataLab

1yr ago

MinerU:One-stop Data Extraction Tool - PDF Document Extraction;Web Page & E-book Extraction

MinerU is a one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction. Complex multi-modal documents mixed with pictures, tables, formulas, etc. into clear and easy-to-analyze Markdown format.
View more