Build a FB Scraper using Macros, PHP cURL or anything else!

IN PROGRESS
Bids
18
Avg Bid (USD)
$1119
Project Budget (USD)
$750 - $1500

Project Description:
** As FB uses Ajax to load its content, it is important that you make sure that you are able to do this job before placing a bid. **

Summary

This project is to build a scraper using macros – ie. iMacros – or PHP Curl’s for Facebook’s Newsfeed (“NF”) ads.
--
Identifying NF Ads

Facebook's NF ads the ones that show between status updates of your friends here: http://tiny.cc/ystr6w The challenge is, there's several different formats of NF ads as can be seen here: http://tiny.cc/cttr6w What we're looking to do is identify story/update DIVs that contain the word "Sponsored" - as this is the common attribute among all NF ads. Then extract data from those DIVs only.

Here are a few sample NF ads:

http://s23.postimg.org/8kgpalvln/Sample1.gif
http://s23.postimg.org/dx5joqji3/Sample2.gif
http://s23.postimg.org/7xhsl2ypn/Sample3.gif

We will provide you with the accounts that are display these ads.
--
Logging Into Accounts

When logging in to each account, the macro will need to ensure that the appropriate proxy for each account is used – which we will provide. When switching between accounts, the macro will need to clear all Temporary Internet Files (including Cookies) to ensure that the accounts aren't linked together.
--
Extracting NF Ads

The scraper will need to continue scrolling to the bottom of the page, until it encounters the following messages:
* Old FB Format: "Add your friends to see more of their photos and stories in your news feed."
* New FB Format: "There are no more posts to show right now."
It will then need to identify the DIVs that contain the text "Sponsored" and:
* Identify a unique parameter (to keep track of when it was first seen, last seen, times seen)
* Extract all the content within the DIV
* Click the advertiser's link and record all URL redirects
* Load and save any images displayed within the DIV
* Save which account saw the advertisement
* When the advertisement was first seen
* When the advertisement was last seen - if the same ad has been seen several times
* The number of times the advertisement was seen - if the same ad has been several times

However, we do not need to record social information such as the number of likes, comments, etc. All this information will need to be saved in whichever format you prefer.

** As FB uses Ajax to load its content, it is important that you make sure that you are able to do this job before placing a bid. **

Skills required:
AJAX, Anything Goes, PHP, Web Scraping, Website Design
About the employer:
Verified
Public Clarification Board
Bids are hidden by the project creator. Log in as the employer to view bids or to bid on this project.
You will not be able to bid on this project if you are not qualified in one of the job categories. To see your qualifications click here.


$ 1427
in 22 days
$ 773
in 10 days
$ 1134
in 20 days
$ 1184
in 12 days
$ 1500
in 25 days
$ 1443
in 21 days
Hire wildlily980
$ 1111
in 10 days
$ 1184
in 10 days
Hire webscrapinggurus
$ 773
in 15 days
Hire aoefmpes
$ 888
in 10 days