This project is concerned with scraping data from public Facebook URLs. If the project goes well, the winner will be offered a subsequent similar project for Twitter data scraping.
The project will pay a total $375 plus a variable performance bonus of up to $125.
This project is split into three parts and requires a submission prior to the project being awarded (there is no benefit to bidding lower on this project) :-
(1) Write your own or use a commercial website scrapaing tool to provide example output from two facebook test profile URLs (target test profiles are provided below) into a text file, plus a description of how the tool works (manual/automatic) and what web-server scripting environment it operates in, specifying any subscription or other operating costs, if we were to run it on our web-severs.
The aim of the project is to collect as much publicly-available data as possible for specific profiles, including when each piece of data was created and/or [login to view URL] is no limit to the amount and type of data that you can collect (posts, image-names,friend-lists, likes, etc..... anything)
You can, if the data is available, perform the same data scraping on the immediate (first-level) "friends" of the target profile, to the same degree as the target profile, but you should not collect data from second-level friends (friends-of-friends), only first-level (immediate) friends.
There should be no limit to the amount of data you can collect, other than when there is no more. Part of the task is to explore every mechanism to capture as much data as is possible from a public profile, to the level described above.
It is important to note that we are only interested in data that is "Publicly and Freely" available. The use of the data is for analytical research purposes and is not concerned with the identity of the Facebook profile and no commercial use of data will be made.
The target profiles are:-
[login to view URL]
[login to view URL]
The project will then be awarded to the candidate that provides the best output, judged on:-
(a) quantity of scraped info,
(b) appropriateness of format of data in readiness for upload into database, and
(c) quality of field naming and content (field/content)
(d) speed of response to this request
(e) ability to work independently
Milestone 1 released: $75
(3) For the selected project winner, run the same scraping tool against 1100 profile URLs that will be supplied and submit the results, one file per target profile, with all files zipped into single file for submission.
Milestone 2 released: $125
(3) Install the scraping tool on our web-servers with documentation as to how it works - it should be simple and should be able to read a list of profiles and execute the scraping into individual text files.
Project Completed : Final Milestone released: $175
13 freelancers are bidding on average $509 for this job
I am confident I am the right candidate for this project as I have done many similar projects in the past. With years of experience in this field, I believe this project will be very easy for me.