Great tool for structured data extraction! I use Tabstack to pull data from various websites, and then feed the JSON output into tools.pixiaoli.cn for formatting, validation, and conversion. The combination of Tabstack's extraction API + a client-side JSON formatter/validator is a really efficient workflow for data pipelines. The Mozilla backing for privacy is a huge plus — knowing my extraction data isn't being sold or used for training is reassuring. Love that it enforces schema on every call, even when pages change.
Tabstack by Mozilla
This team delivers.
@Tabstack by Mozilla is a powerful web content extraction and transformation toolkit designed specifically for AI agent builders. It's schema-based. You define a JSON schema for the fields you want, and every call reads the page and maps it to that schema. So you maintain the schema, not the scraping logic.
When they previously launched, the community had many questions on this schema-first approach. Introducing Schema Source - a free utility to help you quickly build your first schema and get started. Enter any URL, get a structured JSON schema. That's it!
You have the API. You have the tools. You have the schema. What will you scrape today?
Tabstack by Mozilla
👋 Product Hunt!
We're launching our free tool, Schema Source. Paste any URL and it generates a ready-to-use JSON Schema for that page's data, as JSON Schema, Zod, or Pydantic.
https://schema.tabstack.ai/
Generating your own is great when the page is unusual or specific. But a lot of extraction work is the same handful of shapes over and over: a job posting, a product listing, a company profile, a real estate listing. So here's another way to do it.
We also open-sourced 46 pre-defined schemas on GitHub, across 10 industry categories: real estate, jobs, e-commerce, finance, healthcare, dev tooling, gov records, travel, b2b intel, and social. Grab the file, point Tabstack at a URL, get data back. https://github.com/Mozilla-Ocho/tabstack-schemas
Grab one or generate your own, whichever fits.
Tabstack by Mozilla
@tessak22 OSS ftw!!