Context.dev - One API to scrape, enrich, and extract the internet

by•
Context.dev is the web context API for AI products and agents. Scrape any URL, crawl sites, turn pages into LLM-ready Markdown, extract structured data into your own schema, capture screenshots, and retrieve logos, colors, fonts, styleguides, company data, and transaction enrichment through one API. YC-backed, no card required, and built so developers or coding agents can integrate in minutes.

Add a comment

Replies

Best

Been using this to replace a scraper that kept breaking. One API call, clean markdown back. The brand data extraction (logos, colors, fonts) is surprisingly useful for onboarding flows. Handles JS-heavy sites better than I expected. 5000+ customers is a decent trust signal. Some niche sites still struggle, but overall solid.

 amazing! Thank you so much.

the markdown output came back clean enough that i barely had to clean it up before feeding it into my agent. brand extraction on a few random sites was surprisingly on point too, definitely beats maintaining my own scrapers.

 incredible to hear!

The "turn any page into LLM-ready Markdown" part is the piece I keep wishing existed — I do a lot of research-heavy work and getting clean, structured text out of messy pages is always the bottleneck before anything downstream is useful. Two genuine questions: how does it hold up on JS-heavy or auth-gated pages that don't render server-side, and is scraped content cached/versioned so I can tell whether I'm reading a fresh pull or a stale one? Typed SDKs across TS/Python/Ruby is a smart touch.

 this reads a bit like AI but yes!

How is it different from Exa?

Really interesting approach Yahia. I'm curious—what was the hardest part to get right while building a single API that handles scraping, extraction, and enrichment reliably at scale?

Handling scrapers, proxies, and sitemap parsing over and over is definitely a major headache when building AI products. The agent-native aspect—where a coding agent can literally sign up and wire the API itself—sounds incredibly powerful and futuristic. How does Context.dev handle websites that have heavy anti-bot protections or complex CAPTCHAs when an agent triggers a crawl?

Sounds great. How do you handle onclick events, if data is hidden and you have to click on an element to actually see it?

web data + enrichment in one api is the real bottlneck for agents rn 👏 well deserved #1

Would love to hear the difference between this and Apify?

 Apify is a marketplace of APIs

We own everything end to end, so we can offer you killer rates at a much higher quality, and that's if you're only talking about scraping. The rest of the APIs can be mixed/matched to build something much more interesting.

Great product! Congrats on the launch đź’Ż

First
Previous
•••
567
Next
Last