i need a small tool to scrape data from HTML files. i have millions of HTML files which contain peoples various pieces of information. and over a Million HTML files with 10 Million Public Profile. so i need to get that data in CSV file which will contain all the information from the HTML files. the scraper needs to be Multi threaded so it can scrape thousand of profile per minute using the computer hardware's performance. it needs to be windows program. or this can work in another way, you can just scrape those profiles to me within next 5 days. whichever works i will with it.
i have also attached a sample file of how the HTML file will be. (All those files are save to my local HDD)
48 freelancers are bidding on average $305 for this job
Hi, can extract those within next 3 days, no problem. probably even less. you upload those files somewhere and I run extraction on my server, sending you back CSV or xls or JSON or whatever else you want :)
Greetings, I am an experienced professional scrapper and have done similar projects in the past. Same can be verified from my profile. Let me allow to assist you with your requirements. Thanks
Hello, if the files have the same structure I can write you simple python script for this. You can have this done in 1-2 days with testing. I can start working right away. Josef