Scraping public data from the web, transforming it, and using it for a new product can become a very successful business.
What kind of web scraping projects have you worked on and which tools did you use?
122 views
Replies
Best
I worked with Nodejs and puppeteer to scrape many complex sites for clients but now want to make software/tools as a side business. Any idea for me guys?
Report
@naimur103 If you are so experienced with scraping stuff, maybe you could develop a no-code tool for creating custom scrapers :) Through a SaaS
I never finished it - but I started a Strava scraping project. I think there's a ton of suuuuper interesting data in there, although I did it for interests sake, rather than to monetise it.
And yep, like @berthakgokong says - Python, Beautiful Soup, etc.
Report
@berthakgokong@nik_hazell Also pretty cool. I think collecting data for a while and then figuring out what do to with it later is also not a bad idea. The value of data in general will be rising in the future. Have you tried puppeteer?
@berthakgokong@nik_hazell You should check it out. The usability is pretty good, especially if you use it with Typescript. It is based on Chromium.
All in all it has some quirks when controlling a headless browser engine, but I think that's not the fault of Puppeteer itself.
Report
Funny thing, I scraped the "Top Most Upvoted Products" using Bardeen.ai (our tool). It worked really nicely.
BUT I wanted to figure out which month is the best to launch, and turns out they haven't updated that page, so now I gotta scrape the all products.
https://www.producthunt.com/e/50...
Let's see where this takes me.
Report
@renat_gabitov Haha I also thought about it once. Can't you use the graphql api of producthunt? I think it is not public...
I remember my rookie days at coding. I was usually doing a lot of parsing, mostly bots fetching videos from various web sources. Everything done with preg_match function in PHP 🥲
@jawerty Looks awesome! Does it store all the scraped data on a custom db? Or is there something happening on the fly, when doing a search?
Report
We at ejobsitesoftware used to receive many queries for the jobs database. So we have built a custom job scrapper in Laravel using Goutte.
Check screenshot - http://cricketu.com/web-scrap/
Report
@jobboardsoftware That looks pretty cool James! Did you think about publishing it? (Paid or open source)
Replies
KaraboAI
Zappi Ad Predictor
Zappi Ad Predictor
Product Hunt
AnnounceKit
AnnounceKit
Is My CEO A Fraud?
Warmup Inbox