Pduut

Pduut

From textbooks to structured knowledge — PDFs, untangled

3 followers

Pduut is an open-source PDF extractor built for students & researchers. It splits books page-by-page, capturing text, equations, and diagrams into structured JSON—perfect for RAG datasets. Join us, contribute, and make learning accessible!
Pduut  gallery image
Pduut  gallery image
Pduut  gallery image
Free
Launch tags:Open SourceEducationGitHub
Launch Team / Built With