Camelot

PDF table extraction for humans

get it
#5 Product of the DayOctober 13, 2018

Camelot is a Python Library to extract tabular data from PDFs.

Around the web

Reviews

227447
1000883
1469807
 +1 review

Discussion

You need to become a Contributor to join the discussion - Find out how.
1468730
Vinayak MehtaMaker@vortex_ape · I love to create art .
Hello Product Hunt! I'm Vinayak, creator of Camelot. There are many open-source (Tabula, pdf-table-extract) and closed-source (smallpdf, pdftables) tools to extract tables from PDFs. But they either give a nice output or fail miserably. There is no in between. This is not helpful since everything in the real world, including PDF table extraction, is fuzzy. This leads to the creation of ad-hoc table extraction scripts for each type of PDF table. We, at SocialCops, created Camelot to offer users complete control over table extraction. It is a Python library to extract tabular data from PDFs! You can install it using conda or pip! Check out the installation instructions in the README: https://www.github.com/socialcop... Great documentation is available here: https://camelot-py.readthedocs.i... We would be really grateful if you could give us any feedback that can help us improve it! You can follow the development on GitHub.
Ayush Chandra@ayush_chandra · Research Intern & Tech Evangelist
Great job!! 😊 Will check it out