Files Search Engine - repost

Avg Bid (USD)
Project Budget (USD)
$1500 - $3000

Project Description:

We are looking for someone able to create a public search engine using elastic search and nutch for crawling or the constellio system.

What we need:
1. Crawl [url removed, login to view], [url removed, login to view], [url removed, login to view], [url removed, login to view] and [url removed, login to view]
2. Use the best technique to crawl up to 1 - 2 million pages per day.
3. extract all the files name + download links
4 stock it in our database.
4. Make it "searchable" inside our search engine.

We have the global idea but looking for someone able to advice us how to realize this project like a consultant and then provide the technology to start the project.

Skills required:
Human Resources, Internet Marketing, Marketing
About the employer:
Public Clarification Board
Bids are hidden by the project creator. Log in as the employer to view bids or to bid on this project.
You will not be able to bid on this project if you are not qualified in one of the job categories. To see your qualifications click here.

$ 6185
in 3 days
$ 3092
in 20 days
$ 2500
in 3 days
$ 2894
in 45 days
$ 2666
in 45 days
Hire tikumishra
$ 2222
in 10 days