crapePro is a Python CLI tool for web scraping and data extraction. It handles both static HTML sites (blogs, news, e-commerce) and JavaScript-heavy pages (Twitter, YouTube, SPAs) using Playwright's headless browser.
Key features:
- CSS selectors, XPath, table extraction
- Article text extraction
- Metadata extraction (OpenGraph, JSON-LD)
- Site crawling
- Change detection
- Export to JSON, CSV, XLSX, Markdown, SQLite