Free Automated Web Scraping Tool for Windows

Would you recommend Octoparse to a friend?
DiscussionYou need to become a Contributor to join the discussion - Find out how.
Kamiyab Mukhi@kamiyab_mukhi · Software User
Actually I am a web developer and I had a project where client also needed the data to be filled up from sites like alibaba, justdial, olx etc. I have come to octoparse searching for those data extractors so that I can get the data as I wanted and then upload it to my database. Well believe me, it didnt go well in the first attempt. it was when i gone thru all the sample cases and the videos that I have understood how to properly use the software. For , justdial it was fine but had a lot of problem with . I had to get the phone number which requires a login and it took one complete day to get succeed at the end, and that too with local extraction, but still it suffices me. So in this way I was able to collect fair amount of data from here and could use it on my client's site. The Xpath tool helped me a lot in justdial site and the cookie tool in specific, for alibaba. I think speed is also good but could even be better. Hope my case study would guide new software users a direction of use / faith in the software.
Fizah Zainudin@fizah_zainudin · EngD Student at UCL
I am currently using Octoparse to obtain twitter data for my EngD research. I use the functionality to automatically open and log into my twitter account and perform a specific search on twitter to get the data that I require. I created a task using advanced mode to initially open the twitter page and then login using my credentials. The task will then search for a specific set of lexicons that I have defined and will then show the results of the search. The task will then scroll down the twitter search results for predefined number of times in order to get as many tweets as possible and then the task will iterate through tweets to extract the username of the person tweeting and also the actually tweets and the timestamp. I then used the extracted data for further sentiment analysis using a separate tool. I find that it is very intuitive to use Octoparse. There are also many resources available on its website, and a video specifically showing how to extract tweets which is very useful. Also, there appear to be an increasing number of users which means that I expect more discussions and tips to be posted online moving forward. I also find that for a resource hungry processes, it is very useful to have the ability to run those processes on cloud so that it will finish quicker. I have only scratched the surface in terms of the functionality available on Octoparse and look forward to experiment with them and potentially use this tool as the strategic platform for data extraction for my research. I find other tools out there to be have very limited capabilities and are not as intuitive to use as Octoparse. Addtionally, I also look forward to introduce Octoparse as the strategic platform for social media data extraction to my department at the university. As it stands, I also feel that Octoparse has the right price point which is not too prohibitive for user in a small organisation as myself.