I'm looking into using the Gigablast open source search engine to find every car dealership in the United States. I need someone that's familiar with this search engine so you can create a profile that I can copy over to my local version of Gigablast and I can pull in a list of all car dealers in the country on a monthly basis.
I need a web scraping expert that can extract the car dealership name and contact information. As well as generate a listing of any staff, such as the sales managers.
I'm using Gigablast because it's free and open source. I'm not interested in paying for any lists you may have. I want a long term solution that works over the next couple years.
### I need to clarify what I'm looking for in this project ###
Great responses from many and I'm excited about working with one of you!
This is a two phase project with a human readable summary of all the results
Phase 1: Scrape the world wide web here in the United States and catalog the "URL" for every car dealership website.
Phase 2: Scrape each URL collected in Phase 1 to retrieve all the contact information for each dealership.
Just to be clear, I do NOT have any website URLs for any car dealership. You must build the entire list of car dealers here in the United States.
After which, you must then scrape each dealer for all the necessary contact information. Ideally, the each car dealer would have the following:
Car Dealership Name
City, State, Postal Code
Emails (any and all)
Phone (any and all)
Staff (Names of personnel)
### Your proposal ###
Please tell me if you can use the Gigablast search engine to accomplish this task. If not, then I need a solution that I can use in the future. For example, tell me if you're using a Python script, or Java, or an open source program to scrape for this information.
I'm happy to see all the responses, but I'm not getting anything more than "I can do it." Please tell me if you have ever worked with Gigablast. If not, then tell me what program you use.
I'm a Linux SysAdmin and I will be recreating your solution locally on a CentOS KVM. I will be running this web scraping program in the future, so I need a solution I can transfer to my local KVM.
Gigablast is already up and running locally here, and if you have not worked with it, please clone it from Github and see if this program will do the job. If not, please tell me what you plan to use so I can go about loading the new program locally.
Thank you for all your responses.
26 freelancers are bidding on average $117 for this job
Can we discuss. I have done similar many projects. I can give you car dealers contact data. Relevant Skills and Experience leads Proposed Milestones $250 USD - d