Context.dev - One API to scrape, enrich, and extract the internet

Context.dev is the web context API for AI products and agents. Scrape any URL, crawl sites, turn pages into LLM-ready Markdown, extract structured data into your own schema, capture screenshots, and retrieve logos, colors, fonts, styleguides, company data, and transaction enrichment through one API. YC-backed, no card required, and built so developers or coding agents can integrate in minutes.

Add a comment

Replies

Best
Hey Product Hunt 👋 I’m Yahia, founder of Context.dev. I built Context.dev because every AI product eventually runs into the same problem: models are powerful, but they don’t know what’s happening on the live web. So teams end up building the same annoying infrastructure over and over again: scrapers, crawlers, browser rendering, proxy handling, sitemap parsing, Markdown cleanup, screenshots, logo extraction, brand enrichment, company data pipelines, and more. Context.dev turns all of that into one API. You can scrape any URL, crawl a site, extract clean LLM-ready Markdown, pull structured data into your own schema, capture screenshots, retrieve logos/colors/fonts/styleguides, enrich companies, and give your agents fresh web context in seconds. The part I’m most excited about: Context.dev is agent-native. You can integrate it yourself, or paste one line into your coding agent and let it sign up, grab an API key, and wire the API into your codebase. We’re YC-backed, have a free tier with no card required, and are already powering products at teams like Mintlify, daily.dev, DocsBot, Chatwoot, and more. Would genuinely love feedback from the PH community, especially from anyone building AI agents, RAG pipelines, onboarding flows, enrichment workflows, or anything that needs live web data. Happy to answer questions all day!

 Hey Yahia!

Awesome product, we plan to integrate it with the product we're building. Hopefully it improves our agents web capabilities :)

Cheers!

 Amazing! Happy to help with the integration, my email is if you have any questions.

 let's goooo Yahia!

 LETS GOOOO

 This product looks really cool. How does it get around user permission restrictions? Often, we don't want to grant access to overly sensitive

 hey! what type of permission restrictions?

 bro, just changed our logo and you got the new one already! This is almost realtime.

 hahaha we work fast.

 This looks great!!

 thank you!!

Congrats on launch #2 — "extract structured data into your own schema" caught my eye. I spend a lot of my time building exactly this kind of pipeline for e-commerce catalog data, so two questions from that trench:

How do you handle sites behind serious bot protection (Akamai/Kasada-tier)? Is the escalation abstracted away, or do those URLs fail with an error I can act on?

And is JS rendering on by default, or a per-request flag with its own latency + pricing cost? Recently when I looked for a solution related to it, came across this nice model from Jina AI, its really good.

 Excellent question

For context (pun): I'm the founder, i also wrote 90% of the code myself, probably half of that by hand.

I built the API i always wish i had, which means

  • per minute rate limit, no concurrency bs

  • relentless focus on quality + cost efficiency

  • permanent backwards compatibility

Every single request is JS rendered.

1 credit = 1 successful scrape, even if we had to go to the moon to get the data.

Stealth is built in and automatic.

 thats wonderful hear every request is JS rendered, will definetly give a strong try :)

 please do! reach out to me at if you have any questions!

Pricing tied to successful scrapes only looks useful! Paying for failed fetches could be painful, I know this from own experience. Curious on the stealth layer as plenty of sites serve completely different content by visitor country (price, availability, language)... Can I pin the exit region per request or does the geo just fall out of whatever proxy the pool grabs that day?

 yep the country parameter is in there!!

Interesting. Congrats on the launch. How does it handle sites with aggressive bot protection or frequent layout changes?

 we handle both. you never have to worry about a thing :)

 Covered by default at no additional cost, i recommend you check it out on our free tier!

We have been using at Notra for a bit now and its great, much cheaper than Firecrawl which we used before and with no concurrent browser limits. The team also provides top tier support and listens to our crazy ideas! 10/10 left no crumbs

 man, i really do appreciate you. Absolute pleasure working with you and really happy you're enjoying the platform :)

Right, I wasn't doubting the render fidelity, I meant determinism across fetches. Same URL scraped today vs next week: if the live DOM reorders a section, the markdown shape moves with it and an agent that indexed against the first shape drifts. Do you expose a content hash or a diff between fetches, so a pipeline can tell 'page actually changed' from 'page just reordered'? That's the bit that decides whether I wire it into an agent loop or keep it a one-off pull.

 good point and a great idea, i will build this shortly :)

   I'd be into this.

   now i have to build it twice as fast

 100000% this is an excellent point, we're working on a custom "hash" that takes into account whether a page materially changed rather than shipped a new design or animation. Excellent question!

Awesome stuff !🚀

Making the swathes of data available on the web useful context is essential to the AI era, we have only had great experiences with Context!

Where do you see the puck heading in a year or so?

 Truly appreciated, it's been an honor to have you as a customer for so long.

The goal is simple, turn the internet into an API.

 wishing you all the best on that journey😁 congrats on the launch!

I have been using context for two of my products and its super reliable. Thank you for amazing work and congrats to the team :)

 Thank you so much Tarun!

We have a-lot more coming soon :)

I'm so glad I discovered Context.dev. My first task was collecting SVG logos for hundreds of brands. I tried many tools, paid and open source, and only Context.dev solved it.

Getting started with the API was easy. I especially liked the ready-made prompt for Claude Code. It took just a few minutes from registration to seeing results in my project.

Now I use Context.dev in my personal projects and recommend it to my clients when I do AI transformations of their websites. The use cases keep multiplying for me. Custom reports, logos, branded Open Graph images.

thank you for this service! Happy to support it 🙌

 SUPERB to hear, thank you!!

Hey , I've been following you and context.dev for some time. Congrats on the launch!

I'm curious to ask, what's the main differentiator between a product like this and Tavily?

I personally had concerns about the accuracy/quality of info coming from Tavily - curious to hear your thoughts.

 Tavily is search, they built their own index of the web, hence why so many people have issues about the quality

We have world class extraction infrastructure, so it's far more accurate since we always go and grab data in whatever format from the source directly!

123
•••
Next
Last