Screen scrape data collection from affiliate sites

IN PROGRESS
Bids
19
Avg Bid (GBP)
£536
Project Budget (GBP)
£250 - £750

Project Description:
I would like an automatic web scraping software built to get affiliate data.

It will need to scrape the following sites:

AppThemes
BluChic
Magazine3
Gabfire
Themify
FameThemes
ElegantThemes
CSSIgniter
ThemeFuse
MyThemeShop
Obox Themes
ThemeTrust
Tokokoo
Theme Furnace
Organic Themes
Event Manager Blog
WPZOOM
Theme Junkie
Templatic
ColorLabs
TeslaThemes
Theme Spectrum
UpThemes
AppifyWP
InkThemes
HermesThemes
Engine Themes
DailyWP
GavickPro
DIYthemes
Headway
StudioPress
Cobalt Apps
Premium Press
WP Engine
ThemeForest
Mojo Themes

Getting template information and posting it into my WordPress website under a custom post type I have created.

You will need to get:

- Theme title
- Features
- Main image
- Demo & Download Link + Adapt it to have my affiliate code in
- Tags
- Category


It will then need to do the following:

Add a new “Theme”
Title = Theme Name
Body = Theme features / bullet points
Short Description = The description of the theme from affiliate
Add tags relevant to the theme
Add the theme category
Add the Affiliate from the list
Upload the image as “Featured Image”
Demo Link format is themetitle-demo, i.e. http://wpthemify.com/go/sympathique-demo
Download link format it themetitle-download, i.e.
http://wpthemify.com/go/sympathique-download
Yoast:
Focus Keyword = #ThemeTitle #AffiliateName
SEO Title = #ThemeTitle #AffiliateName - WPThemify.com
Meta description = Meta description from affiliate site
Same for Facebook + Google in social


Output:
Will be a report on number of new items, number of duplicates, number of themes with validation error (plus details of those themes) in a CSV delivered to email. Must be executable from a cron job so it can run daily.

Validation:
- Must be a unique theme that is not already on the website (Name + Affiliate)
- Must have all fields
- Must have affiliate link, plus validate it and returns a 200 http response

Skills required:
Data Mining, PHP, Software Architecture, Web Scraping
About the employer:
Verified
Public Clarification Board
Bids are hidden by the project creator. Log in as the employer to view bids or to bid on this project.
You will not be able to bid on this project if you are not qualified in one of the job categories. To see your qualifications click here.


£ 257
in 5 days
£ 736
in 10 days
£ 747
in 21 days
Hire ithinksolutions
£ 721
in 30 days
£ 526
in 20 days
£ 750
in 45 days
£ 315
in 3 days
£ 750
in 13 days
£ 555
in 3 days
£ 444
in 10 days