Crustdata

Crustdata

People and company search APIs built for AI agents

5.0
1 review

2K followers

Live Company and People Data via APIs Search, enrich, or watch a person or company in real-time. Get instant updates when an event happens. For example, get updates when a person a) changes job or b) gets a promotion or c) adds a new skill or d) makes a new post.
This is the 3rd launch from Crustdata. View more

Web search API by Crustdata

Launched this week
Accurate and the fastest web search API for AI Agents
Let your AI agent search across the whole web with the fastest and accurate web search API by Crustdata - Control your search results by filtering on the website domains, date posted, sources and geo location. - Search across the news articles and social posts published just an hour ago. - Search for open source creators and authors of research papers - Get exact answers to your query via deep research mode - Combine with our fetch API to extract all content from a web page
Web search API by Crustdata gallery image
Web search API by Crustdata gallery image
Web search API by Crustdata gallery image
Web search API by Crustdata gallery image
Web search API by Crustdata gallery image
Web search API by Crustdata gallery image
Web search API by Crustdata gallery image
Web search API by Crustdata gallery image
Free Options
Launch tags:SalesAPIDeveloper Tools
Launch Team
Anima Playground
AI with an Eye for Design
Promoted

What do you think? …

Garry Tan

Hi Product Hunt, Garry here. I’m excited to help launch Crustdata (YC F24)'s newest product: The WebSearch API.

It lets you search the whole web via a simple API, making it accessible for AI agents, products and tools. 

Think of this API as the fundamental layer on which you can build any tool that needs public web info (people, companies, posts, pricing pages, research articles, docs, blogs, etc). 

Why it matters:

The web wasn’t built for AI agents. It was built for the human eye. Inconsistent HTML, CAPTCHAs, and brittle scrapers make the web hard to use as input for agents.

As a result, agents lack vital qualitative information and cannot perform even simple tasks without accessing information from the web.  

The WebSearch API returns search results and page content as clean, predictable JSON you can use as input for AI agents, workflows, and apps.

What you can build with it:

  1. AI recruiting tools: Find candidates such as researchers and engineers from public work (papers, GitHub profiles).

  2. AI SDRs and GTM agents: Find more information about prospects from podcasts, blogs, forums, and feed this into personalized outreach workflows.

  3. Competitive Research Tools: Track competitor pricing, product launches, and market positioning so you can detect opportunities and risks.

  4. SEO Tools: Pull meta data, citations, and ranking data without building fragile scrapers.

  5. Investment agents: Aggregate company information and market sentiment from news, blogs, and product pages to spot investment opportunities and risks.

  6. AI coding agents: Fetch and parse the latest documentations, recent library updates so they can generate accurate code. 

How to integrate into your AI agent or tool:

Get started in under 60 seconds. Add these lines of code with your API key to access the entire web as structured JSON:


curl 'https://api.crustdata.com/screener/web-search?fetch_content=true' \

 --request POST \

 --header 'Content-Type: application/json' \

 --data '{

 "query": "Open AI recent product launches",

 "geolocation": "US"

}'


That’s it. It’s as simple as that. 


For non-technical users:

If you aren’t comfortable with coding, don’t worry. Our documentation is written to work with AI coding assistants like Claude or ChatGPT.

Simply:

  1. Copy our API documentation

  2. Paste it into Claude Code or Cursor

  3. Describe what data you need in plain English (e.g., "Find all TypeScript developers who contributed to Next.js projects")

  4. Let the AI generate the API call for you

  5. Get working code you can use immediately, without writing a single line yourself

Example data you can find with this API:

For Sales: 

  • To find specific pain points or angles to personalize messaging through information from a funding article: 

Search query: {founder name}  site:techcrunch.com

  • To find specific pain points or angles to personalize messaging based on information from blogs or podcasts the prospect appeared on: 

Search query: {founder name} “podcasts” “blogs”


For Recruiting:

  • To find engineers with a specific skillset from Github: 

Search query: "i'm a" "frontend developers" "react" site:github.com

  • Finding researchers by publication topics: 

Search query: "machine learning" "interpretability" site:arxiv.org


For Market Research and Investment:

  • To analyze the recent market sentiment of a company: 

Search query: Salesforce AND (sentiment OR review OR "market perception" OR "analyst sentiment")

Date range: startDate: July 13, 2025 | endDate: Jan 13, 2026

  • To surface product launches or interesting events about a company:

Search query: "lovable AND ("product launch" OR "new feature" OR "beta" OR "partnership" OR "introducing") site:lovable.dev"


For AI coding agents: 

  • To track library updates and changes:

Search query: "React 19" AND ("breaking changes" OR "migration guide" OR "what's new")

Key Features and Differentiators:

  • Fastest WebSearch API with the subsecond latency

  • Most accurate WebSearch API compared to other Web APIs

  • Access the latest web pages indexed within minutes of being updated and available in search results

  • Results from hundreds of sources from the web for one query

  • Filters to control results (language, location, site/source, date ranges)

  • APIs built for production with high throughput

Thanks for your support!

S.S. Rahman

@garrytan Congrats to Garry and the Crustdata team on the launch. This feels like a genuinely foundational layer for AI agents, especially given how brittle traditional scraping is.

Nithish A

@syed_shayanur_rahman Thanks for the support Syed!

Boris Gostroverhov

@garrytan Garry, just go ahead! This product really looks impressive 🦄

Nithish A

@gostroverhov Thank you for the support! Appreciate it!

Bhavin Sheth

@garrytan This is a big unlock for agent-based products. Clean, reliable web data is still one of the biggest bottlenecks for real AI workflows, and brittle scrapers just don’t scale.

Returning structured JSON with filters + near-real-time freshness feels especially valuable for GTM, research, and coding agents. Curious how teams are thinking about evaluation of result quality over time as the web keeps changing.

shreya chaurasia

Really interesting approach. Curious how teams think about keeping signals fresh as web content changes quickly.

Nithish A

@shreya_chaurasia19 Thanks Shreya. This would be helpful for people that are building AI agents and need live signals. For example AI sales agents could monitor new features on prospect websites and use that as a signal or personalization angle for outreach.

Zeiki Yu

Congrats on the launch! Crustdata looks perfect for powering serious, real-time GTM AI agents.


Nithish A

@zeiki_yu Thanks Zeiki! That's the goal. We aim to be the only place all AI agents go to for people, company and event data. By combining our WebSearch API and our B2B data APIs, you get the most powerful and extensive people and company dataset!

Daniel Ahmadizadeh

@zeiki_yu yes!

Daniel Ahmadizadeh

Excited for this launch!

Abhilash Chowdhary

@daniel_ahmadizadeh1 thanks for the support

Austin Heaton

@daniel_ahmadizadeh1 congrats on the launch! A friend of mine is testing your API and depending on his results I might contact you.

Daniel Ahmadizadeh

@austin_heaton here to help / jam, @austin_heaton ! thanks for the kind words

Yuanyuan Zhang

Love the simplicity here! Building and maintaining internal scrapers to bypass CAPTCHAs is a nightmare for small teams. I’m curious: How do you handle the freshness of the data? Is it crawling in real-time when the API is called, or are you serving results from a pre-indexed cache? For a high-frequency tool like a price tracker or a daily research agent, what’s the typical latency we should expect?

Nithish A

@yuanyuan_zhang0104 All our data is fresh. We show results that include webpages published just minutes ago. Our API is built for providing realtime search results, so it will definitely work for integration into high frequency tools such as the ones you've mentioned.

Nuseir Yassin

Is an hour the buffer time or can it even provide real-time updates? Example, some news broke a minute ago?

Nithish A

@nuseir_yassin1 Hey Nuseir, the API can provide real-time updates. It can show you webpages that were published less than 2 minutes ago.

Curious Kitty
Search quality is getting worse due to SEO spam and duplicated content across providers. What techniques do you use (or plan to use) to improve trust and usefulness of results for agents—beyond returning a SERP—and how do you measure success (e.g., downstream task success rate vs human relevance labels)?
Nithish A

@curiouskitty We have a proprietary method of indexing and ranking the web, ensuring low quality sources are not shown to AI agents. We're constantly working on it to make it better.

Roop Reddy

@curiouskitty  @nithish_a1 Could you share a little about how do you check what is a low quality source?

1234
Next
Last