One API to scrape, enrich, and understand the web.

Start new thread

Context.dev - One API to scrape, enrich, and extract the internet

Y Combinator

•1d ago

Context.dev is the web context API for AI products and agents. Scrape any URL, crawl sites, turn pages into LLM-ready Markdown, extract structured data into your own schema, capture screenshots, and retrieve logos, colors, fonts, styleguides, company data, and transaction enrichment through one API. YC-backed, no card required, and built so developers or coding agents can integrate in minutes.

Replies

Best

Context.dev

Maker

📌

Hey Product Hunt 👋 I’m Yahia, founder of Context.dev. I built Context.dev because every AI product eventually runs into the same problem: models are powerful, but they don’t know what’s happening on the live web. So teams end up building the same annoying infrastructure over and over again: scrapers, crawlers, browser rendering, proxy handling, sitemap parsing, Markdown cleanup, screenshots, logo extraction, brand enrichment, company data pipelines, and more. Context.dev turns all of that into one API. You can scrape any URL, crawl a site, extract clean LLM-ready Markdown, pull structured data into your own schema, capture screenshots, retrieve logos/colors/fonts/styleguides, enrich companies, and give your agents fresh web context in seconds. The part I’m most excited about: Context.dev is agent-native. You can integrate it yourself, or paste one line into your coding agent and let it sign up, grab an API key, and wire the API into your codebase. We’re YC-backed, have a free tier with no card required, and are already powering products at teams like Mintlify, daily.dev, DocsBot, Chatwoot, and more. Would genuinely love feedback from the PH community, especially from anyone building AI agents, RAG pipelines, onboarding flows, enrichment workflows, or anything that needs live web data. Happy to answer questions all day!

Report

2d ago

@yahia_bakour3 Hey Yahia!

Awesome product, we plan to integrate it with the product we're building. Hopefully it improves our agents web capabilities :)

Cheers!

Report

1d ago

Context.dev

Maker

@ankushkun Amazing! Happy to help with the integration, my email is yahia@context.dev if you have any questions.

Report

1d ago

Content Writing Jobs

@yahia_bakour3 let's goooo Yahia!

Report

1d ago

Context.dev

Maker

@tomaslau LETS GOOOO

Report

12h ago

@yahia_bakour3 This product looks really cool. How does it get around user permission restrictions? Often, we don't want to grant access to overly sensitive

Report

22h ago

Context.dev

Maker

@auula_ hey! what type of permission restrictions?

Report

20h ago

Migma AI

@yahia_bakour3 bro, just changed our logo and you got the new one already! This is almost realtime.

Report

14h ago

Context.dev

Maker

@adam_lab hahaha we work fast.

Report

11h ago

Co.Lab Learning

@yahia_bakour3 This looks great!!

Report

12h ago

Context.dev

Maker

@herenhuang thank you!!

Report

11h ago

Congrats on launch #2 @yahia_bakour3 — "extract structured data into your own schema" caught my eye. I spend a lot of my time building exactly this kind of pipeline for e-commerce catalog data, so two questions from that trench:

How do you handle sites behind serious bot protection (Akamai/Kasada-tier)? Is the escalation abstracted away, or do those URLs fail with an error I can act on?

And is JS rendering on by default, or a per-request flag with its own latency + pricing cost? Recently when I looked for a solution related to it, came across this nice model from https://huggingface.co/jinaai/ReaderLM-v2 Jina AI, its really good.

Report

1d ago

Context.dev

Maker

@akbar_b Excellent question

For context (pun): I'm the founder, i also wrote 90% of the code myself, probably half of that by hand.

I built the API i always wish i had, which means

per minute rate limit, no concurrency bs
relentless focus on quality + cost efficiency
permanent backwards compatibility

Every single request is JS rendered.

1 credit = 1 successful scrape, even if we had to go to the moon to get the data.

Stealth is built in and automatic.

Report

1d ago

@yahia_bakour3 thats wonderful hear every request is JS rendered, will definetly give a strong try :)

Report

1d ago

Context.dev

Maker

@akbar_b please do! reach out to me at yahia@context.dev if you have any questions!

Report

8h ago

Pricing tied to successful scrapes only looks useful! Paying for failed fetches could be painful, I know this from own experience. Curious on the stealth layer as plenty of sites serve completely different content by visitor country (price, availability, language)... Can I pin the exit region per request or does the geo just fall out of whatever proxy the pool grabs that day?

Report

1d ago

Context.dev

Maker

@artstavenka1 yep the country parameter is in there!!

Report

20h ago

Voquill

Interesting. Congrats on the launch. How does it handle sites with aggressive bot protection or frequent layout changes?

Report

23h ago

Context.dev

Maker

@henry_habib we handle both. you never have to worry about a thing :)

Report

20h ago

Context.dev

Maker

@henry_habib Covered by default at no additional cost, i recommend you check it out on our free tier!

Report

17h ago

Notra

We have been using Context.dev at Notra for a bit now and its great, much cheaper than Firecrawl which we used before and with no concurrent browser limits. The team also provides top tier support and listens to our crazy ideas! 10/10 left no crumbs

Report

1d ago

Context.dev

Maker

@dominikkoch man, i really do appreciate you. Absolute pleasure working with you and really happy you're enjoying the platform :)

Report

1d ago

Right, I wasn't doubting the render fidelity, I meant determinism across fetches. Same URL scraped today vs next week: if the live DOM reorders a section, the markdown shape moves with it and an agent that indexed against the first shape drifts. Do you expose a content hash or a diff between fetches, so a pipeline can tell 'page actually changed' from 'page just reordered'? That's the bit that decides whether I wire it into an agent loop or keep it a one-off pull.

Report

22h ago

Context.dev

Maker

@dipankar_sarkar good point and a great idea, i will build this shortly :)

Report

20h ago

WUPHF by Nex.ai

@dipankar_sarkar @yahia_bakour3 I'd be into this.

Report

14h ago

Context.dev

Maker

@dipankar_sarkar @frandias now i have to build it twice as fast

Report

13h ago

Context.dev

Maker

@dipankar_sarkar 100000% this is an excellent point, we're working on a custom "hash" that takes into account whether a page materially changed rather than shipped a new design or animation. Excellent question!

Report

17h ago

Propane

Awesome stuff @yahia_bakour3 !🚀

Making the swathes of data available on the web useful context is essential to the AI era, we have only had great experiences with Context!

Where do you see the puck heading in a year or so?

Report

1d ago

Context.dev

Maker

@benfleming Truly appreciated, it's been an honor to have you as a customer for so long.

The goal is simple, turn the internet into an API.

Report

1d ago

Propane

@yahia_bakour3 wishing you all the best on that journey😁 congrats on the launch!

Report

1d ago

Trace

I have been using context for two of my products and its super reliable. Thank you for amazing work and congrats to the team :)

Report

1d ago

Context.dev

Maker

@tarat Thank you so much Tarun!

We have a-lot more coming soon :)

Report

1d ago

I'm so glad I discovered Context.dev. My first task was collecting SVG logos for hundreds of brands. I tried many tools, paid and open source, and only Context.dev solved it.

Getting started with the API was easy. I especially liked the ready-made prompt for Claude Code. It took just a few minutes from registration to seeing results in my project.

Now I use Context.dev in my personal projects and recommend it to my clients when I do AI transformations of their websites. The use cases keep multiplying for me. Custom reports, logos, branded Open Graph images.

@yahia_bakour3 thank you for this service! Happy to support it 🙌

Report

1d ago

Context.dev

Maker

@zmoki SUPERB to hear, thank you!!

Report

20h ago

Hey @yahia_bakour3, I've been following you and context.dev for some time. Congrats on the launch!

I'm curious to ask, what's the main differentiator between a product like this and Tavily?

I personally had concerns about the accuracy/quality of info coming from Tavily - curious to hear your thoughts.

Report

1d ago

Context.dev

Maker

@ziyad_elhamzawi Tavily is search, they built their own index of the web, hence why so many people have issues about the quality

We have world class extraction infrastructure, so it's far more accurate since we always go and grab data in whatever format from the source directly!

Report

20h ago

1 2 3

•••