SEO HTML Web Scraper/Parser and Data Extraction

This project was successfully completed by cosmicwheels for $2200 USD in 14 days.

Get free quotes for a project like this
Project Budget
$1000 - $3000 USD
Completed In
14 days
Total Bids
Project Description

This project is for a web scraper tool. The entire project can be in Python or PHP. I'm interested to hear your thoughts on which is the preferred language.

The idea is to start with a web page that requests email and url on a web form. Once submit is pressed, on-page SEO values such as title, H1, meta description and more would be analyzed for their existence and character counts, all using AJAX/jQuery (no page refresh).

Complete list of data to be analyzed and parsed:

- meta title, description, keyword, robots (count characters on title and desc)

- title and description should be compared to url for keywords

- canonical url exist? what is it?

- is seo friendly url? no variables such as [url removed, login to view]

- facebook open graph tags exist?

- twitter card tags exist?

- h1, h2, h3, number times used and contents

- alt tags on images and contents along with image filename

- existence of facebook, twitter, pinterest, linkedin, youtube, google+ links

- presence of any inline code (javasript and css)

- existence of [url removed, login to view] file at root directory

- does flash exists?

- count W3C validation errors

- count backlinks, get mozrank using SEOMoz API

- grab top keywords using SEMRush

Code should handle large volume as it will be used in a CRM in the future.

Data extracted should have the ability to add to a mysql database and a export to a PDF for display and sent to the email from the starting web form.

See these links for PHP examples:

[url removed, login to view]

[url removed, login to view]

This is a good example of a live site:

[url removed, login to view]

Thank you for reading this project. I'm looking to find a coder for a long term relationship on future projects. NDA will be required to start work and I prefer US/UK developers.

Please let me know if you have any questions on the requirements.

Completed by:
Skills Required

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online