Summarize web pages with content scraping
Budget $30-250 USD
Job Description:
My goal is to find the highest earning pages on a freelance writing website.
I am open to suggestions on how best to accomplish my goal. Here is my solution:
===
I need a PHP or Python program that will create a simple report output as a HTML table with the following column titles:
Page Title (hyperlinked to the page being scraped)
Content Creator
Content Manager
Amount Earned
===
I have attached 2 sample images from the website that should help you understand what needs to be scraped. I will provide exact and clear information when this project starts.
On this website, every user has a public member area page that links to the pages they have created. Contained on each page is all of the information that needs to be scraped including: The page title, the username of the person who created the page the person who manages the content and the total amount of money that a page has earned.
I would like to input a series of usernames. I would like a plain, simple HTML table output. I would like every row of the table stored in a simple mysql database with the date that the row was inserted for a future addition.
===
Speed is very important. Only 1 request for the page should be made and any tricks that you have to make the script work faster will be appreciated.
This should be a simple script for someone who knows how to scrape content from a website, store it in a mysql database and output that data in a simple html table.
I will not pay more than $100 for this project.