Schema Source is live, plus 46 pre-built schemas for common data needs
Schema Source—paste any URL and it generates a ready-to-use JSON Schema for that page's data, as JSON Schema, Zod, or Pydantic.
Generating your own is great when the page is unusual or specific. But a lot of extraction work is the same handful of shapes over and over: a job posting, a product listing, a company profile, a real estate listing. So here's another way to do it.
We also open-sourced 46 pre-defined schemas on GitHub, across 10 industry categories: real estate, jobs, e-commerce, finance, healthcare, dev tooling, gov records, travel, b2b intel, and social. Grab the file, point Tabstack at a URL, get data back. https://github.com/Mozilla-Ocho/tabstack-schemas
Grab one or generate your own, whichever fits.
29 views


Replies
WebCurate.co
This is actually pretty useful. Open-sourcing the schemas is a nice touch too, great job!
Tabstack by Mozilla
@hosseinyazdi thank you for your kind words. I lead Tabstack GTM, but I'm a developer, so I try to do as many technical marketing campaigns as possible. Gotta keep finding reasons to code. 😉