*******
This beta search engine must be created and working in 2 weeks. PLEASE Do not even bid if you do not think you can make this timeline...
*******
I am looking for something like this: <[login to view URL]>
but with asp.net and ms sql. User must point crawler to 1 url and get every page - see below for more details.
1) Crawler must run on a basic host (<[login to view URL]>)
2) Crawler can run by a form click, but in the end must run independently, on a scheduler. This can be hardcoded.
3) Installation of crawler must only be to copy files to a webserver.
4) Crawler must have "page delay" - to throttle download requests to the website being indexed.
5) the start page can be hardcoded for this round
All configs in the future will be in a user admin page and configurable via webpage.
Engine/Index:
1) Speed is key.
2) My focus is ASP.NET and MS SQL Server. I would like this developed so that a MySQL Database could also be used with a little extra work (BUT that is out of scope for this request).
3) There should be hit-hilighting of terms on the result pages (like Google).
4) Index should accomodate multiple domains.
5) like Google, the summary under the title should show the first bit of text that is relevant to the terms typed into the search (also like Google).
6) Crawler should accomodate for first 2 page parameters. Example:
[[login to view URL]][1]
Any parameters after the first 2 should be ignored.
I want to start small, and build this up in the next 6 months.
What else have I forgotten?
Thanks, Dale
## Deliverables
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.
b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.
3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).
## Platform
Must run on [login to view URL]
asp.net
file based index or MS SQL server.
Speed is key.
I want to be able to index 2000-3000 pages
type in 2-3 word queries
and get a response in less than 3-4 seconds