You have chosen to sponsor your bid up to a maximum amount of .
I need the project completed in 24 hours.
The programming project is on a task part of a research project. I have an excel file (MASTER FILE) with observations on thousands of entries (UNITS). For each entry I have an identifier code (ID) and a list of variables (POINTS). Each UNIT has at least two POINTS. An example is :
ID POINT1 POINT 2 POINT 3 POINT 4
1 101 102 103
2 102 104 105 101
3 105 101 102
I need a program that automates the following procedures for the thousands of UNITS.
STEP 1: For each UNIT, I need to create all possible combinations of POINTS. For the above example, the combinations will be for ID # 1:
101-101 , 101-102 , 101-103 , 102-101 , 102-102, 102-103 , 103-101 , 103-102 , 103-103
STEP 2: I have an excel file (USE FILE) with a list of all possible combinations and a corresponding value for the combination.
The file looks like this:
101 101 0.5
101 102 0.06
101 103 0
102 101 0.22
102 102 0.03
I need to compare the created combinations of POINTS from the master file (STEP 1) with the values in the use file. If the average of values of points combinations (X-Y) and (Y-X) is above 0.1, then I want the newly created output file (OUTPUT) to write the combination, otherwise the combination is not included in the output file. For example,
101-102 has a value of 0.06 and 102-101 has 0.22. The average of the two is 0.14 which is above the threshold of 0.1. The combinations 101 102 and 102 101 will be written in the output file. The output file MUST be a text file that has the combinations in two columns. As example is:
So for each Unit or ID we will have a txt file.
STEP 3: A publically available program or algorithm uses the OUTPUT file in STEP 2 above as an input to calculate a certain value (INDEX). The program is an .exe file that uses the runs in windows like the command prompt (black screen with questions to be answered by typing the answers). The first question is to enter the name of the input file (the OUTPUT from step 2 above). IMPORTANT: The extension of the input file used (OUTPUT) must be changed from .txt to .dat. The next two questions ask about specific numbers that will be identical. I will provide you with these numbers. The program then runs and stops automatically once it is finished. The results are stored in a dat file named "modules.dat". IMPORTANT: each time the program runs, the results are stored in a file with the same name "modules.dat" if the program ran in the same folder previously, the newly created file will overwrite the existing file.
The result file "modules.dat" is a text file that includes information , the important is written in last line. An example of a results file is:
1 4 7 5 2 7.000000 5.000000 2.000000 --- 4 2 1 3
2 10 16 15 1 16.000000 15.000000 1.000000 --- 100 6 9 8 10 5 11 12 7 13
4 4 6 5 1 6.000000 5.000000 1.000000 --- 23 21 22 20
# Modularity = 0.505487
The important information is the modularity INDEX, 0.505487. I need to run the program for each and every UNIT or ID (thousands).
STEP 4: For each UNIT or ID, I need to read the # modularity (INDEX) and write it back to the original file (MASTER FILE) or a new file as long as the ID is retained so I can identify the UNIT. The target file should look like this:
Step 5: I want to repeat the steps above with a small change in STEP 2. Instead of calculating the average of values of two point combinations now, I simply want to take each pair of points by itself and keep it if its value is above 0.1.
Additional Project Description:
07/28/2013 at 19:58 HKT
For Step 2, I also need the corresponding number (count) of combinations saved in the output file. The final deliverable will be as following: three columns
ID - Number of combinations from step 2 - the INDEX from step 4