Closed

Web scrapping and web automation bot or customization of irobotsoft

This project received 8 bids from talented freelancers with an average bid price of $291 USD.

Get free quotes for a project like this
Employer working
Project Budget
$30 - $250 USD
Total Bids
8
Project Description

I needed a Data extraction bot. A good part of the functionalities I need are already in the open source software irobotsoft. A customization of this, more appropriate script or one build from scratch will be welcomed. The bot should be able to do the following:

1. Extract all applicable fields and entries given a list of search keywords
2. Extract all entries plus sub entries given a list of links
3. save content in a local MySQL database and CSV
4. Remove duplicates from the final results
5. Sort out items missing with extraction using keywords search vs extracting with respect to given links

Note
Fields to be scraped can be selected by auto-record and can be adjusted manually to improve accuracy. The fields to be scrapped will differ per website. During the selection of the field the user will indicated to which column in the dataset the fields belong. The keywords to be used for the search will be provided as a list.

A good part of what I need exist in irobotsoft or other extraction tools. I need someone who knows the tools or can build one to customize it for my use. Please let me know if you have any questions, suggestions or need further information.

Regards
Timon

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online