Collins Munyao

Docunerve - PDF extraction API for scanned and digital documents

by•
Most parsers break on scanned PDFs. Docunerve triages every page atomatically - digital pages parse instantly, scanned pages route through OCR - and returns clean Markdown, JSON, Text or HTML. Auto tags type, entities and topics on every call. Free to start.

Add a comment

Replies

Best
Collins Munyao
Maker
šŸ“Œ
Hey Product Hunt šŸ‘‹ I'm the solo founder of Docunerve, building from Nairobi. I built this after hitting the same wall over and over: PDF parsers work beautifully in the demo, then you hand them a real scanned contract or invoice and they return an image placeholder and nothing usable. The fix was always the same - stitch together three tools and a fragile OCR step. So I built one API that handles it end to end. You send a PDF; Docunerve triages each page automatically. Digital pages parse instantly, scanned pages route through OCR, and you get back clean Markdown, JSON (with bounding boxes for RAG citations), or HTML. Every response auto-tags the document type, language, entities, and topics - no extra calls. A few things I'm proud of: • It's built on a parser ranked #1 in open benchmarks - 0.907 overall, 0.928 on tables • Bounding boxes on every element, which makes it genuinely useful for RAG pipelines • Flat per-page pricing instead of confusing credit multipliers It's free to start, no credit card. I'd love for you to throw your messiest PDF at it and tell me what breaks - that feedback is exactly what I need right now. I'll be here all day answering everything. Thank you for taking a look šŸ™