All activity
LEI QINleft a comment
Here is a demo for image-based pdf (inlcude a table): https://pdf2markdown.io/results/req_034c1ed6-d0ce-4d7e-b980-e0e7f9e8d896?embed=true&token=4ec01a3c2f20900b99219c48415268100d0d006676c1b9fcb9c19e07b6b4d62d&expires=1776261732059

PDF2MarkdownParse PDF and images to Markdown in seconds
LEI QINleft a comment
Demo: https://pdf2markdown.io/results/req_034c1ed6-d0ce-4d7e-b980-e0e7f9e8d896?embed=true&token=4ec01a3c2f20900b99219c48415268100d0d006676c1b9fcb9c19e07b6b4d62d&expires=1776261732059

PDF2MarkdownParse PDF and images to Markdown in seconds
LEI QINleft a comment
I built PDF2Markdown because moving PDFs and scans into Markdown for RAG, docs pipelines, or AI agents was painful. Generic extraction tools often break on image-based PDFs, and maintaining custom parsers was costly. The approach: a single API that handles native PDFs, scanned images, and mixed documents, with layout-aware parsing and optional structured JSON. Then came the CLI and agent skills...

PDF2MarkdownParse PDF and images to Markdown in seconds
PDF to Markdown — convert PDF and image documents to clean Markdown. Built for developers: REST API for docs, content migration, and pipelines. Handles text and image-based PDFs (scans, faxes) with layout-aware parsing. Returns Markdown and structured JSON. Sync and async endpoints. Official JS SDK and CLI. Skills for Cursor, Claude Code, Codex, Windsurf, and other AI agents so they can “convert this PDF” directly from chat.

PDF2MarkdownParse PDF and images to Markdown in seconds
LEI QINstarted a discussion
A demo for image-based PDF
https://pdf2markdown.io/results/req_034c1ed6-d0ce-4d7e-b980-e0e7f9e8d896?embed=true&token=4ec01a3c2f20900b99219c48415268100d0d006676c1b9fcb9c19e07b6b4d62d&expires=1776261732059
LEI QINleft a comment
✨ What makes us unique LLM-First Design: Unlike traditional scrapers, AnyCrawl outputs clean, structured data optimized for Large Language Models - with native JSON extraction and Markdown formatting Multi-Engine Architecture: Choose between Cheerio (fastest), Playwright (full JS), or Puppeteer (Chrome) - all in one tool True High Performance: Native multi-threading and multi-process support...

AnyCrawlAnycrawl is a high-performance alternative to Firecrawl
