Launched this week

CatchAll by NewsCatcher
Build any dataset from the web. Filtered to your criteria.
144 followers
Build any dataset from the web. Filtered to your criteria.
144 followers
CatchAll is a web search API that builds structured datasets from the open web. Submit a query, and it scans thousands of web pages, validates every result, and returns clean, deduplicated records — not a ranked list of links, but a dataset of real-world events, ready for workflows and pipelines.




Free Options
Launch Team / Built With


CatchAll by NewsCatcher
Hey Product Hunt! Artem here, co-founder of NewsCatcher.
Back in 2020, Maksym and I were data engineers who couldn't find a reliable way to get clean, structured news data — so we built our own infrastructure. Five years later, it powers intelligence workflows at banks, hedge funds, and risk platforms, continuously indexing billions of web pages.
Today we're launching CatchAll — a web search API that builds structured datasets from the open web.
The web is full of real-world events that never get assembled into usable data: which fintechs raised Series A rounds last quarter, which crypto exchanges faced regulatory action this month, which AI companies were acquired this week. CatchAll finds them all, validates every result, and returns a clean deduplicated dataset — not a list of links.
Submit a natural language query and CatchAll retrieves a massive candidate set, filters out noise, and returns structured records ready to pipe into an AI agent, a monitoring workflow, or an analytics pipeline. You can also set up a monitor to re-run any query on a schedule and push fresh results to a webhook automatically.
We're in early days and genuinely here for feedback. Sign up and you'll get 2,000 free credits to start. Share your use case in the comments and we'll 5x them.
This is interesting but how do we make sure that extracted is legit?
CatchAll by NewsCatcher
@ashishkingdom few layers to this:
Every result comes with source citations — you can always trace back to the original publication
Before extraction, CatchAll clusters related pages about the same event and applies validators to filter out irrelevant results
You can define your own validation rules to tighten precision for your use case
It's not a black box — the sources are always there.
@kotartemiy well thats impressive
支持自然语言查询,不需要复杂的语法。还能设置自定义参数(时间范围、语言、地区、域名过滤),我用它专门追踪日本政府官网的新能源政策,精准获取一手信息,排除第三方解读干扰
CatchAll by NewsCatcher
@summer_dev thanks for sharing that great use-case! If you need the extra credits, just reach out.
Spend with Ukraine
Such a beautiful website you have, guys! 😍
CatchAll by NewsCatcher
@illya_krupenikov thanks to the most wonderful team! ;)
Documentation.AI
Whoa, I was browsing through some of your datasets. Fantastic!
Inbox Zero
Love it!