Google Country and Google Translate Scraping

AWARDED
Bids
11
Avg Bid (USD)
$455
Project Budget (USD)
$250 - $750

Project Description:
I need a script, that will run off of my own servers, to take names/entries from two different sites and automatically research the data on those entries. I will give you the two sites via PM if you want to get a look at it as they require private logins. These are basic html sites with text entries, so I don't foresee a whole lot of trouble retrieving the entries from them.

Once you get the entries from the source sites, then you will need to scrape the corresponding results from the destination sites (ie: Turkish Dictionary list names will need to have results retrieved from google.tr. Pinyin results will need to be retrieved from google.cn and baidu.com, etc).

The various destination sites from which you will need to get results include: Various google country sites (google.tr, google.it, etc), babelfish.com, langtolang.com, baidu.com, Google Translate, Yahoo.com, etc.

One important thing the script you write has to account for is that the entries that you get from the two source sites, has to have the extensions (.net, .com, or .org) stripped to get accurate results from the destination sites.



The scraped results will need to be put into an excel sheet similar to the one attached, and automatically emailed to me daily.

I don't care what language the script is done in as long as it is efficient.


Thank You for making a bid and I look to get started very soon.

Additional Project Description:
03/19/2010 at 1:48 EDT
YesI need a script, that will run off of my own servers, to take names/entries from two different sites and automatically research the data on those entries. I will give you the two sites via PM if you want to get a look at it as they require private logins. These are basic html sites with text entries, so I don't foresee a whole lot of trouble retrieving the entries from them.

Once you get the entries from the source sites, then you will need to scrape the corresponding results from the destination sites (ie: Turkish Dictionary list names will need to have results retrieved from google.tr. Pinyin results will need to be retrieved from google.cn and baidu.com, etc).

The various destination sites from which you will need to get results include: Various google country sites (google.tr, google.it, etc), babelfish.com, langtolang.com, baidu.com, Google Translate, Yahoo.com, etc.

One important thing the script you write has to account for is that the entries that you get from the two source sites, has to have the extensions (.net, .com, or .org) stripped to get accurate results from the destination sites.



The scraped results will need to be put into an excel sheet similar to the one attached, and automatically emailed to me daily.

I don't care what language the script is done in as long as it is efficient.


Thank You for making a bid and I look to get started very soon.

Skills required:
Electronic Forms, Javascript, Perl, Script Install, Web Scraping
Additional Files: DictionaryList_03_12_2010.xls
About the employer:
Verified
Public Clarification Board
Bids are hidden by the project creator. Log in as the employer to view bids or to bid on this project.
You will not be able to bid on this project if you are not qualified in one of the job categories. To see your qualifications click here.


Hire creatorul
$ 750
in 6 days
$ 300
in 4 days
Hire MAnkita
$ 350
in 5 days
Hire aruhat
$ 600
in 7 days
$ 350
in 5 days
Hire gavinlee86
$ 250
in 5 days
$ 750
in 45 days
Hire mahi86
$ 300
in 10 days
Hire ez2010
$ 350
in 5 days
Hire mikkelslayer
$ 750
in 7 days