PHP command line CLI script to crawl or spider a website and log data

IN PROGRESS
Bids
9
Avg Bid (USD)
$148
Project Budget (USD)
$30 - $250

Project Description:
I need an extremely small PHP CLI (command line php) script that can crawl a website to find all of the web pages on that site starting from the main default page.

As it follows links to crawl the entire site, I need it to record the following data in a csv file:

1. url
2. meta title
3. meta description
4. meta keywords
5. H1 tags separated by pipes (in one field)
6. H2 tags separated by pipes (in one field)
7. H3 tags separated by pipes (in one field)
8. If simple and affordable since this is on a tiny budget, each of the images on the page that are not layout related or on the rest of the pages of the site. Ideally, only images unique to that page. Record file names in one cell separated by pipes and, in the last cell, include the image alt text separated by pipes.

Please message me if there's anything you need.

Skills required:
Anything Goes, Perl, PHP, Python, Software Architecture
About the employer:
Verified
Public Clarification Board
Bids are hidden by the project creator. Log in as the employer to view bids or to bid on this project.
You will not be able to bid on this project if you are not qualified in one of the job categories. To see your qualifications click here.


$ 100
in 3 days
$ 240
in 3 days
Hire idleswell
$ 136
in 6 days
Hire cyberguard
$ 200
in 7 days
Hire EvanKos
$ 121
in 7 days
$ 140
in 3 days
$ 150
in 3 days
Hire pvdenis76
$ 95
in 3 days
Hire ruben77
$ 150
in 3 days