This project involves writing an ETL tool specifically for conversion to and from STATA data format (<[url removed, login to view]>). STATA is a statistical package that has native text-file load and writing functions. However the data we deal with is in the 100million record range, and these? STATA functions are simply too slow, and do not handle many data formats. Fortunately STATA has a plugin API.
This project thus should at the mininim replace the existing STATA functions and have more.
In addition, we have many other software development partnership opportunities. Thus the ideal bidder should have depth in other areas of application development such as Web Applications etc. We want to use the project to evaluate the quailty of work and process with the develpment vendor we choose.
We expect the following deliverables:
- design document of the ETL tool
- the pluggin will have to convert to the following minimum list:
? + delimited text
? + fixedwidth text
? ? + xml format
? + excel format
- the pluggin must perform better than STATA functions
- deployment/user documentation
- test plan