Google Country and Google Translate Scraping

This project received 11 bids from talented freelancers with an average bid price of $455 USD.

Get free quotes for a project like this
Employer working
Project Budget
$250 - $750 USD
Total Bids
Project Description

I need a script, that will run off of my own servers, to take names/entries from two different sites and automatically research the data on those entries. I will give you the two sites via PM if you want to get a look at it as they require private logins. These are basic html sites with text entries, so I don't foresee a whole lot of trouble retrieving the entries from them.

Once you get the entries from the source sites, then you will need to scrape the corresponding results from the destination sites (ie: Turkish Dictionary list names will need to have results retrieved from Pinyin results will need to be retrieved from [url removed, login to view] and [url removed, login to view], etc).

The various destination sites from which you will need to get results include: Various google country sites ([url removed, login to view], [url removed, login to view], etc), [url removed, login to view], [url removed, login to view], [url removed, login to view], Google Translate, [url removed, login to view], etc.

One important thing the script you write has to account for is that the entries that you get from the two source sites, has to have the extensions (.net, .com, or .org) stripped to get accurate results from the destination sites.

The scraped results will need to be put into an excel sheet similar to the one attached, and automatically emailed to me daily.

I don't care what language the script is done in as long as it is efficient.

Thank You for making a bid and I look to get started very soon.

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online