Convert the financial data in the Moody/Mergent manuals into a custom dataset (likely from the begining of each manual to roughly 1955)
-- Industrial Manual since 1920
-- Public Utility Manual since 1914
-- Transportation Manual since 1909
-- Bank & Finance Manual since 1928
It can be all data entry or hopefully there is some type of OCR technology that can get the project started and then supplement with a data entry service to finish/check all the datapoints before our final review.
We picture this project taking place in two stages. First a test phase in which we do 2-3 years of manuals and then we review the database. Then if it works out we move forward on all 30+ years of data.
In the attachments I supplied a few sample pages from some of the manuals. I tried to include the table of contents of a few sample manuals for potential bidders to see the size of each of these manuals. The excel document contains a subset of what the database will look like. The list of companies are only those that are listed in the CRSP database there are thousands of more companies in the actual manuals each year.
Another point worth mentioning is that there is a lot of inconsistency in the way various financial fields are named and the companies that provided data vary a decent amount in the format that it was submitted. For example the top line on the Income Statement for may be called Revenue, Sales, Total Income, Gross Income, Operating Income, etc...and it may or may not be net of operating costs. Another example is PPE, some submit PPE in one line and others break out Real Estate, Machinerary, Mining Property, Equipment, etc. all into separate lines.
Not sure if it would be best to enter in every field as it is labeled and then once it is done I could go through the process of consolidating (this would reduce the chance for mislabeling but the number of fields would grow dramatically)
I could try to come up with a set of instructions (as simple as possible) to guide those doing the data entry to correctly consolidate fields that may have multiple labeling (Sales vs. Revenue vs. Total income etc.)
The second option may be difficult if the work is our sourced overseas and also difficult for the OCR if one is used. So we may not have a choice but to enter in each field as it is labeled. We can discuss this more once the project starts.
(we can also get copies of the manuals on DVD but they are digital images (like photo copies) of every page and the quality is poor so we assume Microfiche is a better option)
1 database with 30 years of data 1925-1955
Sept or October 2011
First Quarter 2012
42 freelancers are bidding on average $5251 for this job
Dear Sir We can do this [url removed, login to view] are ready for [url removed, login to view] have strong experience on same kind of projects. Please check PMB. We have some problem in project. Waiting for your positive reply. Thank you
Hi, I am an expert in MS-Office, OCR Tools. I am also good at English. All your work will be proofread and delivered with good quality. I will stick to the deadlines. Thanks and Regards, Venkat.
Sir, I am a new freelancer but am an experienced CPA working with excel and word for almost 20 years now. Can deliver quality output. Can work immediately. Thanks.