I need a script written that will login to Facebook and scrape data from a specific Private group that I am a part of. I will enter the permalink ID into the tool and it will begin scraping that content until it is told to stop. It needs to store the parent comment as well as all child comments associated with it.
End goal is to be able to see the conversation history (everything that it was able to scrape) prior to the comment thread being deleted.
Example of URL:
[url removed, login to view]
The ID on the end (Permalink) is what I will enter into the tool to begin the process. A script will then begin checking that post every X (something I can configure such as seconds/minutes/Hours) and update the comments table accordingly.
This will probably need to be some type of CURL/HTTP call that will constantly check for data and update the tables accordingly.
I understand that it might miss a few comments before between scrapes if it is deleted but having most of the data is important.
This will only ever be groups that the user account is a part of so they will have access to the post.
I do not have access to a user token for the private group so I cannot use Facebook Graph. None of the admins are willing to generate a token on their behalf and due to the group being Private, permission to see the group feed is not an option in the API.
To summarize, I will enter the GROUP ID and the PERMALINK ID into the tool (a group ID that the user scraping is a part of). At every interval (something I can configure) it will check to see if the parent post has been updated since the last time it checked (such as adding/removing data). If it has been, it will store the data so that I can see the edit history of a specific Parent or Child as well).
Once it is no longer able to find the comment thread (it gets deleted) the tool will then stop running.
******If you bid on this project and intend to message me, please briefly explain how you will approach this task and the technology you plan to use.******