I need to gather some data from the CMIE CAPEX website at [url removed, login to view]
The task is very simple. For each Indian district, the site lists the number of capital investment projects that were active in a particular quarter of the year. One needs to just click through the list of districts to display a table. This table needs to be copied and pasted into an Excel spreadsheet and saved. We can use Google docs for that. No further processing of the data is needed.
This task needs to be repeated for each district three times to get different kinds of project counts (active projects, stalled projects and balance of outstanding projects).
There are roughly 600 districts in India. This means 600*3 = 1800 click and copy paste events need to be performed. Having done it myself on a subset of districts, I reckon that 30-40 districts can be done in one hour. We will start out with only a subset of 200 districts. That means in total 600 clicks and copy and pastes.
The only difficulty is that one needs to set up a free account on the CMIE website to get access to the data. For that one needs to set up multiple email logins, as each user access is restricted to only 50 pageviews. This means that in order to get the full set, one would need to set up 36 accounts. This is the only challenge in this task.
Please get in touch if you have any further questions.
I will provide detailed instructions to whomever takes up the job and walk you through the task through Skype screen capture. You can work with Google Docs to save the data, this way I could have a look straight away. The first milestone will consist of the getting the first 200 districts. You will get a list of which districts/ states I am interested in.