Files Search Engine - repost
- Status Closed
- Budget N/A
- Total Bids 6
We are looking for someone able to create a public search engine using elastic search and nutch for crawling or the constellio system.
What we need:
1. Crawl [url removed, login to view], [url removed, login to view], [url removed, login to view], [url removed, login to view] and [url removed, login to view]
2. Use the best technique to crawl up to 1 - 2 million pages per day.
3. extract all the files name + download links
4 stock it in our database.
4. Make it "searchable" inside our search engine.
We have the global idea but looking for someone able to advice us how to realize this project like a consultant and then provide the technology to start the project.Get free quotes for a project like this
Looking to make some money?
- Set your budget and the timeframe
- Outline your proposal
- Get paid for your work
Hire Freelancers who also bid on this project
Looking for work?
Work on projects like this and make money from home!Sign Up Now
- The New York Times
- Wall Street Journal
- Times Online