DocuFlow is a containerized, event-driven pipeline that automates ingestion, OCR, parsing, and analytics for invoices and financial documents.
Regex is dead. DocuFlow is an open-source, Dockerized pipeline that uses Gemini 2.5 Flash to intelligently extract vendor, date, and amount data from invoices. Just drop a file in a folder, and watch the database update in real-time.
- Shashank0701-byte/docuflow.