Parse genetic data file and upload to SQL Server 2008 R2

CLOSED
Bids
2
Avg Bid (USD)
$515
Project Budget (USD)
$30 - $5000

Project Description:
We require a contractor to develop PERL code that will parse and upload the following example file to a SQL Server 2008 R2 instance. The database design is required for this project and must be normalized to minimize redundancy (i.e., multiple tables with unique IDs and foreign keys). We will import many files of this type so the code must obtain and assign unique IDs. Database design must also incorporate appropriate indexes.

The example file may be accessed through the vendor's FTP server site at:

<ftp://ftp2.completegenomics.com/YRI_trio/ASM_Build36/NA19240/>.

It is located within the compressed TAR file named: GS19240-1100-36-ASM-VAR-files.tar. Once extracted, follow the directory structure below:

.\GS19240-1100-36-ASM

GS0028-DNA_C01

ASM

Extract the compressed BZ2 file named: var-GS19240-1100-36-ASM.tsv.bz2. The table contains the header rows which must be contained in one or more separate tables.

Skills required:
MySQL, Perl, Software Architecture, SQL
About the employer:
Verified
Public Clarification Board
Bids are hidden by the project creator. Log in as the employer to view bids or to bid on this project.
You will not be able to bid on this project if you are not qualified in one of the job categories. To see your qualifications click here.