Closed

PHP web crawler

This project was awarded to MarcusPan for $299 USD.

Get free quotes for a project like this
Employer working
Awarded to:
Skills Required
Project Budget
$250 - $750 USD
Total Bids
15
Project Description

Looking for an experienced PHP programmer to write a web crawler script.
the script should take the web url as input and generate output in csv format.

example: [url removed, login to view]://ap7am.com/telugu-videos-4-other-videos.html&count=25
should crawl the current page and the pages in the pager until it gets the count specified in the url which is 25 in above example.

the crawler should extract whatever important information it can get like in the example above it should get title, thumbnail image url and the crawler should crawl the respective inner detail page like [url removed, login to view] and get embed code and Tags

multiple values in a column should be seperated by a separator like [*#seperator@#]

note that there are 3 parts in the above example. the crawler should get embed code from all the three and separate them with [*#separator@#]

similarly it should work(or another script with slight modifications) for [url removed, login to view]://videomasti.net/page/3/&count=35
for inner detail page like [url removed, login to view]
it should pick first link(because that was recently added). and because first link has 2 parts, multiple embed codes should be separated with [*#seperator@#]

and also work for [url removed, login to view]://videomasti.net/telugu-movie-index-a/&count=35

Use standard crawler engines(free, opensource) like
[url removed, login to view]
[url removed, login to view]

If you have worked on similar task and can show me a sample it will be great.

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online