Completed

SEO HTML Web Scraper/Parser and Data Extraction

This project was successfully completed by cosmicwheels for $2200 USD in 14 days.

Get free quotes for a project like this
Employer working
Completed by:
Skills Required
Project Budget
$1000 - $3000 USD
Completed In
14 days
Total Bids
21
Project Description

This project is for a web scraper tool. The entire project can be in Python or PHP. I'm interested to hear your thoughts on which is the preferred language.

The idea is to start with a web page that requests email and url on a web form. Once submit is pressed, on-page SEO values such as title, H1, meta description and more would be analyzed for their existence and character counts, all using AJAX/jQuery (no page refresh).

Complete list of data to be analyzed and parsed:
- meta title, description, keyword, robots (count characters on title and desc)
- title and description should be compared to url for keywords
- canonical url exist? what is it?
- is seo friendly url? no variables such as [url removed, login to view]
- facebook open graph tags exist?
- twitter card tags exist?
- h1, h2, h3, number times used and contents
- alt tags on images and contents along with image filename
- existence of facebook, twitter, pinterest, linkedin, youtube, google+ links
- presence of any inline code (javasript and css)
- existence of [url removed, login to view] file at root directory
- does flash exists?
- count W3C validation errors
- count backlinks, get mozrank using SEOMoz API
- grab top keywords using SEMRush

Code should handle large volume as it will be used in a CRM in the future.

Data extracted should have the ability to add to a mysql database and a export to a PDF for display and sent to the email from the starting web form.

See these links for PHP examples:
[url removed, login to view]
[url removed, login to view]

This is a good example of a live site:
[url removed, login to view]

Thank you for reading this project. I'm looking to find a coder for a long term relationship on future projects. NDA will be required to start work and I prefer US/UK developers.

Please let me know if you have any questions on the requirements.

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online