Create a Web Scrapping Program

This project received 26 bids from talented freelancers with an average bid price of $281 USD.

Get free quotes for a project like this
Employer working
Project Budget
$30 - $250 USD
Total Bids
Project Description

We want a program that scrapes/extract data from various websites. The program will go to [url removed, login to view] and then put various search inquires (Accounting textbooks, Law textbooks, Chemistry textbooks, Biology textbooks, ETC) and search. Next it will look at the used price of the book. If the price of the book is greater than $40 then it should click on the link and extract the ISBN number. If it is less than $40 then the program should move on to the next book. It will continue this process until it reaches the end of the books/searches and put the information in an excel file. The program will then use the ISBN numbers (in the excel file) to go to [url removed, login to view] and use the ISBN numbers one by one to search the prices for the books. Also, when the program is searching the ISBN numbers on [url removed, login to view] it should uncheck all of the options on the left side except New and Used. Next, the program will look at the lowest price and check the merchant/website. If the website is "Amazon" or "[url removed, login to view]" then it will cancel the search and move on to the next ISBN number in the excel file. If it is not "Amazon" or "[url removed, login to view]" then it will check the condition of the lowest price book and if the condition is "brand new, new, like new, mint, very good, or good" then the program should move on to the seller's comments available below the condition in the same section (If the condition does not match then it should move on to the next result). However when the program is extracting the comments it should look for specific words in the description and if they are present then the program should move on to the next search result and repeat the process. If the next one is good (does not contain "blacklisted words") then the program should extract the information (price, website/bookstore, condition, and the seller's comments) and then put the information in the same excel file as the ISBN numbers. This is the whole process but the program interface should be made as follows: The program should be extremely simple. It should consist of 2 options. One option should be to search all subjects (biology, accounting, etc). We will provide you with a list of subjects for the program to search and also a text file with the "blacklisted words". The program will put the information from each of the subjects into the previously mentioned excel file. The second option should be to search specific subjects. If we select the 2nd option then a dialog box will open on the program which will allow us to input the subjects that we would like to search. The program will then put the extracted information in individual tabs of the excel file. For each of the subjects the program will extract all of the ISBN numbers from Amazon and the information for each of the ISBN numbers. The program will also consist of a button to run the program. I understand that the description seems complex but it is much simpler than it sounds. Please feel free to ask any questions you may have.

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online