Build any dataset from the web. Filtered to your criteria.

Start new thread

CatchAll by NewsCatcher - Build any dataset from the web. Filtered to your criteria.

Y Combinator

•2mo ago

CatchAll is a web search API that builds structured datasets from the open web. Submit a query, and it scans thousands of web pages, validates every result, and returns clean, deduplicated records — not a ranked list of links, but a dataset of real-world events, ready for workflows and pipelines.

Replies

Best

CatchAll by NewsCatcher

Maker

Hey Product Hunt! Artem here, co-founder of NewsCatcher.

Back in 2020, Maksym and I were data engineers who couldn't find a reliable way to get clean, structured news data — so we built our own infrastructure. Five years later, it powers intelligence workflows at banks, hedge funds, and risk platforms, continuously indexing billions of web pages.

Today we're launching CatchAll — a web search API that builds structured datasets from the open web.

The web is full of real-world events that never get assembled into usable data: which fintechs raised Series A rounds last quarter, which crypto exchanges faced regulatory action this month, which AI companies were acquired this week. CatchAll finds them all, validates every result, and returns a clean deduplicated dataset — not a list of links.

Submit a natural language query and CatchAll retrieves a massive candidate set, filters out noise, and returns structured records ready to pipe into an AI agent, a monitoring workflow, or an analytics pipeline. You can also set up a monitor to re-run any query on a schedule and push fresh results to a webhook automatically.

We're in early days and genuinely here for feedback. Sign up and you'll get 2,000 free credits to start. Share your use case in the comments and we'll 5x them.

Report

2mo ago

Spend with Ukraine

Such a beautiful website you have, guys! 😍

Report

2mo ago

CatchAll by NewsCatcher

Maker

@illya_krupenikov thanks to the most wonderful team! ;)

Report

2mo ago

支持自然语言查询，不需要复杂的语法。还能设置自定义参数（时间范围、语言、地区、域名过滤），我用它专门追踪日本政府官网的新能源政策，精准获取一手信息，排除第三方解读干扰

Report

2mo ago

CatchAll by NewsCatcher

Maker

@summer_dev thanks for sharing that great use-case! If you need the extra credits, just reach out.

Report

2mo ago

This is interesting but how do we make sure that extracted is legit?

Report

2mo ago

CatchAll by NewsCatcher

Maker

@ashishkingdom few layers to this:

Every result comes with source citations — you can always trace back to the original publication
Before extraction, CatchAll clusters related pages about the same event and applies validators to filter out irrelevant results
You can define your own validation rules to tighten precision for your use case

It's not a black box — the sources are always there.

Report

2mo ago

@kotartemiy well thats impressive

Report

2mo ago

Documentation.AI

Whoa, I was browsing through some of your datasets. Fantastic!

Report

2mo ago

Inbox Zero

Love it!

Report

1mo ago

I'm a designer shifting focus to startups. I needed to collect actual information in this field. So I tested CatchAll for this, allying different approaches to search, until I found combination that works really great for me - collect market signals with CatchAll and then fed it to Claude for analysis and got an actual picture of where the market is moving right now. This is a research workflow looks really useful.
And I love that interface is really intuitive - so it was easy for me to try and make my first search.

Report

1mo ago

CatchAll by NewsCatcher

Maker

@julia_shtogren the CatchAll + Claude combo is one of our favourite workflows. Glad the interface made it easy to get started, means a lot coming from a designer 😄

Report

1mo ago

@kotartemiy Thank you, Artem! I also didn't mention that I was really impressed to receive a personalised email after my first search - so I made a short presentation with my search case and sent it back :)

Report

1mo ago