CatchAll by NewsCatcher - Build any dataset from the web. Filtered to your criteria.

CatchAll is a web search API that builds structured datasets from the open web. Submit a query, and it scans thousands of web pages, validates every result, and returns clean, deduplicated records — not a ranked list of links, but a dataset of real-world events, ready for workflows and pipelines.

Add a comment

Replies

Best

Hey Product Hunt! Artem here, co-founder of NewsCatcher.

Back in 2020, Maksym and I were data engineers who couldn't find a reliable way to get clean, structured news data — so we built our own infrastructure. Five years later, it powers intelligence workflows at banks, hedge funds, and risk platforms, continuously indexing billions of web pages.

Today we're launching CatchAll — a web search API that builds structured datasets from the open web.

The web is full of real-world events that never get assembled into usable data: which fintechs raised Series A rounds last quarter, which crypto exchanges faced regulatory action this month, which AI companies were acquired this week. CatchAll finds them all, validates every result, and returns a clean deduplicated dataset — not a list of links.

Submit a natural language query and CatchAll retrieves a massive candidate set, filters out noise, and returns structured records ready to pipe into an AI agent, a monitoring workflow, or an analytics pipeline. You can also set up a monitor to re-run any query on a schedule and push fresh results to a webhook automatically.

We're in early days and genuinely here for feedback. Sign up and you'll get 2,000 free credits to start. Share your use case in the comments and we'll 5x them.

Such a beautiful website you have, guys! 😍

 thanks to the most wonderful team! ;)

支持自然语言查询,不需要复杂的语法。还能设置自定义参数(时间范围、语言、地区、域名过滤),我用它专门追踪日本政府官网的新能源政策,精准获取一手信息,排除第三方解读干扰

 thanks for sharing that great use-case! If you need the extra credits, just reach out.

This is interesting but how do we make sure that extracted is legit?

 few layers to this:

  1. Every result comes with source citations — you can always trace back to the original publication

  2. Before extraction, CatchAll clusters related pages about the same event and applies validators to filter out irrelevant results

  3. You can define your own validation rules to tighten precision for your use case

It's not a black box — the sources are always there.

 well thats impressive

Whoa, I was browsing through some of your datasets. Fantastic!

Love it!

I'm a designer shifting focus to startups. I needed to collect actual information in this field. So I tested CatchAll for this, allying different approaches to search, until I found combination that works really great for me - collect market signals with CatchAll and then fed it to Claude for analysis and got an actual picture of where the market is moving right now. This is a research workflow looks really useful.
And I love that interface is really intuitive - so it was easy for me to try and make my first search.

 the CatchAll + Claude combo is one of our favourite workflows. Glad the interface made it easy to get started, means a lot coming from a designer 😄

Thank you, Artem! I also didn't mention that I was really impressed to receive a personalised email after my first search - so I made a short presentation with my search case and sent it back :)