Search Engine Parser

  • Status Closed
  • Budget $200 - $300 USD
  • Total Bids 5

Project Description

The project consist in creating a script that takes search results from another search engine

The site will be a file search engine.

The source search engine needs a login to be able to search. So the script should use cookies in order to get search results from the source site.

I will want to be able to use 5 or more users in the script, so that there wont be too many searches from one account.

The script should display from the source search engine, the results with all the details ( ex: file date, file type (image, music, archive , etc) )

For example when i will search in the script : metallica , the script will do the search in the background ( with curl ) in the source serach engine, get the results and display them to the visitor.

There is a problem with the links from the source search engine. All of the url's for the files are temporary and can be accessed only by logged in users. They are a redirect that goes to the public link.

I want to slow in my scrip only the public links.

for example : for metallica the results would be :

[url removed, login to view] , - archive - added on 13 august 2013

the link for [url removed, login to view] in the source search engine is : [url removed, login to view] ,

if a anonymous visitor ( not logged in to the [url removed, login to view]) visit that script it dosent work

if a logged in user visits it , i will redirect to [url removed, login to view] ( the public link available for all visitors)

I want to show in my results for the script only the public link.

I dont know what would be the best method to do this, i am waiting for solutions from potential bidders what they think would be the best and most efficient method.

evrey search will generate a new page for example the metallica search will generate [url removed, login to view]

there will be on the site "latest 20 seraches" , with will show the latest seaches with links to their pages.

i want to mention also that the searches and temp links from the source site expire in 15 minutes.

So a cache should be put in place for the script

The design of the script should be simple, basically a search field + the serach results and latest searches

The search filed will have the same functions at the source seach field, for example search only for images, or music.

I will give more details about the source site with a test account in the PM.

Im open to any other questions.

Get free quotes for a project like this

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online