Web Scraping 🔍🔥

byFeatured•4yr ago

Scraping public data from the web, transforming it, and using it for a new product can become a very successful business. What kind of web scraping projects have you worked on and which tools did you use?

128 views

Replies

Best

KaraboAI

(1) Scrapping job listing websites and creating your own product, mailing list etc for job hunters tools - python, selenium, Beautiful Soup

Report

4yr ago

@berthakgokong Sounds interesting!

Report

4yr ago

Zappi Ad Predictor

I never finished it - but I started a Strava scraping project. I think there's a ton of suuuuper interesting data in there, although I did it for interests sake, rather than to monetise it. And yep, like @berthakgokong says - Python, Beautiful Soup, etc.

Report

4yr ago

@berthakgokong @nik_hazell Also pretty cool. I think collecting data for a while and then figuring out what do to with it later is also not a bad idea. The value of data in general will be rising in the future. Have you tried puppeteer?

Report

4yr ago

Zappi Ad Predictor

@berthakgokong @david_gregorian I haven't - would you recommend?

Report

4yr ago

@berthakgokong @nik_hazell You should check it out. The usability is pretty good, especially if you use it with Typescript. It is based on Chromium. All in all it has some quirks when controlling a headless browser engine, but I think that's not the fault of Puppeteer itself.

Report

4yr ago

Funny thing, I scraped the "Top Most Upvoted Products" using Bardeen.ai (our tool). It worked really nicely. BUT I wanted to figure out which month is the best to launch, and turns out they haven't updated that page, so now I gotta scrape the all products. https://www.producthunt.com/e/50... Let's see where this takes me.

Report

4yr ago

@renat_gabitov Haha I also thought about it once. Can't you use the graphql api of producthunt? I think it is not public...

Report

4yr ago

Product Hunt

@renat_gabitov @david_gregorian You can for sure use our public API for projects https://api.producthunt.com/v2/docs

Report

4yr ago

@renat_gabitov @product_at_producthunt ah nice, thanks for the hint Michael :)

Report

4yr ago

AnnounceKit

I remember my rookie days at coding. I was usually doing a lot of parsing, mostly bots fetching videos from various web sources. Everything done with preg_match function in PHP 🥲

Report

4yr ago

@amirali_nurmagomedov Damn that's old school :P How long ago was that?

Report

4yr ago

AnnounceKit

@david_gregorian it was 2006-2007, damn 16 years ago :(

Report

4yr ago

Job websites, company databases, google serp, booking sites, etc. Mostly using google scrapy.

Report

4yr ago

@victorbjorklund What do you mean by google scrapy?

Report

4yr ago

@victorbjorklund sounds awesome! Just wondering if any legal risks

Report

8mo ago

I'm building a no-code web scraping tool called https://datagrab.io.

Report

4yr ago

@balazsi_robert Looks pretty dope! Did you create a chrome add-on?

Report

4yr ago

@david_gregorian Thanks, David! Yes, I did! :)

Report

4yr ago

Is My CEO A Fraud?

https://Metaheads.xyz - search engine for fb comments. nodejs + selenium :)

Report

4yr ago

@jawerty Looks awesome! Does it store all the scraped data on a custom db? Or is there something happening on the fly, when doing a search?

Report

4yr ago

I worked with Nodejs and puppeteer to scrape many complex sites for clients but now want to make software/tools as a side business. Any idea for me guys?

Report

4yr ago

@naimur103 If you are so experienced with scraping stuff, maybe you could develop a no-code tool for creating custom scrapers :) Through a SaaS

Report

4yr ago

We at ejobsitesoftware used to receive many queries for the jobs database. So we have built a custom job scrapper in Laravel using Goutte. Check screenshot - http://cricketu.com/web-scrap/

Report

4yr ago

@jobboardsoftware That looks pretty cool James! Did you think about publishing it? (Paid or open source)

Report

4yr ago

@david_gregorian We plan to use it along with Job Board Software - https://www.ejobsitesoftware.com and provide job database to job board owners

Report

4yr ago

Warmup Inbox

QApop is build using NodeJS Puppetter and AWS lambda. I also have some side income from consulting around Phantombuster

Report

4yr ago

@fabian_maume QApop looks really good! Thanks for sharing :) Did you already think about applying the same to other (famous) forums?

Report

4yr ago

1 2

Web Scraping 🔍🔥

Replies

Engineering & Development

LLMs

Productivity

Marketing & Sales

Design & Creative

Social & Community

Finance

AI Agents

Trending categories

Top reviewed

Trending products

Top forum threads

Engineering & Development

LLMs

Productivity

Marketing & Sales

Design & Creative

Social & Community

Finance

AI Agents

Trending categories

Top reviewed

Trending products

Top forum threads