Project Description:
I need someone to create a database of Facebook pages with: their ID number, page name, type, and number of likes. Here's my suggested approach, which requires 2 scripts to complete. If you have a better method, by all means go ahead.
A.
First, create a script (in the language of your own preference) that will scrape data from this page http://www.facebook.com/pages/browser.php and gather Page_ids for every Facebook page.
The script needs to:
1) eliminate duplicates (each page id is duplicated several times in the source)
2) load successive pages ad infinitum
3) insert data into a database (mySQL)
B.
Next, we will need another automated script to run this query in FQL (Facebook Query Language) for every page_id returned from the previous script. We will be querying for page name, type, and number of likes, to fill the other columns of the Lexicon
https://api.facebook.com/method/fql.query?query=SELECT page_id, name, type, fan_count FROM page WHERE page_id = ?
This data should be automatically used to update the mySQL database and fill in the remaining columns.
***************
Please don't hesitate to ask questions or make suggestions. I am very open to new approaches to solving this problem.