login
Forgot?

Don't have an account? Register one now!

Login

boolean search tool to search csv files -Single and 2 Byte

Bids 
10
Avg Bid
$154 NZD
$128 USD
CLOSED
  • Project ID:

    1254396
  • Project Type:

    Fixed
  • Budget:

    $75-$175 NZD
    (Approx. $62-$146 USD)

Project Description:

I need a tool which will allow me to search through about 4000 csv files each one being about 30 mb (total is about 100 gigabytes of data) for keywords. I would like to be able to use wildcards as well. I need a visual GUI as well. also a drag and drop interface. I am attaching a sample csv file. The search should be FAST. I want the search to be fast enough to at least be able to search through all the files for a single keyword in 90 minutes.

I want to have searches like
("red barron" or "green curry" or ("hot potato" and "cold soup")) and "sour dough" and NOT "french fried potatoes"

I would like to have wild cards for prefixes, suffixes, and within the words or phrases.

Please tell me the wild card functionality you can provide in your PM bid.

Notes: If I search for the word 'wine' I don't want a word like 'swine' unless specified by some sort of wild card.

THIS TOOL MUST BE ABLE TO SEARCH 2-byte languages like Japanese, Korean, Chinese. NOTICE that in the sample input file (attached) there are both two byte and single byte text in there. This must be handled properly. It can have an option box to say whether the character set is Western or not if that will speed up Western character searches.

Of course, there should be an option for giving a name and location to the resulting output file.

THE OUTPUT FILE MUST BE (see attached sample output.csv file):
return the same fields as the input file
Date, Username,Text, location fields must all be quoted by default but there should be an option for whether a field will be quoted or not for any of the fields.
All commas except field delimters must be removed by default but there should be an option for each field to leave them in.

The key to whether the output file is in the correct format is whether it can be opened in Excel, saved by Excel, and then reopened and still have the correct number of fields. Also, Excel should read the date field as a date.

There should be some sort of progress bar to show how many files have been searched.
In the attached file there are 7 fields. The main field to be searched is field number 4. Other fields should be searchable as well however. Any field in the file should be searchable by selecting that field number (the default should be column 4, the "Text" field.)
If I have a new format for the csv files with additional columns then the tool should be flexible enough so they should also be searchable.
There should also be a Username, Location, Tweet ID range, Date Range search boxes on the tool.

Skills required:

C# Programming, C++ Programming, Java, Perl

Additional Files:

output.csv sample+Input+file.csv

Project posted by:

woody2010 Japan
(43 Reviews)

Online now

Public Clarification Board

1 messages

  • tks73

    Hello!

    Have a look at the first steps which I have done to solve your problem. In example program you can fill a list of files (drag&drop from explorer or through open file dialog), remove file from list, and you can set the encoding (for your example file - "Thai (Mac)") for look how your file "sample+Input+file.csv" will be displayed in interface (i hope everything will be displayed correct).

    To run program you will need a .Net Framework 2.0

    I can finish this project :)

    Good luck...

    Attachment: searchtool.zip

    4 months ago


If you are the project creator or one of the bidders, please Log In for more options.


Awarded Bids

aboltinsh Latvia
Facebook_980898.jpg
aboltinsh
Latvia From Latvia     Offline
  Foundation EUFreelance.com Member
 Accepted
$150 in 2 days 
$75 Milestone Requested
4 months ago
4.9

3.1

4 Reviews
77% Completion Rate
I can create this tool. See details in details in PM.

All Bids ()

wittyDeveloper India
default.jpeg
wittyDeveloper
India From India     Gold Member     Online
  Freelancer Orientation (85%, 99th percentile)
  Employer Orientation (75%, 97th percentile)
$250 in 5 days 
0
4 months ago
5.0

4.5

34 Reviews
95% Completion Rate
I'm ready to start right now.
greggfletcher Bulgaria
greggfletcher
Bulgaria From Bulgaria     Gold Member     Offline
  Freelancer Orientation (95%, 100th percentile)
$175 in 5 days 
0
4 months ago
5.0

2.0

2 Reviews
89% Completion Rate
Hello, sounds like an interesting and challenging project. I am a C# programmer with over 4 years of experience. If interested please contact me, so we can discuss further. Regards.
javagroups India
javagroups
India From India     Offline
$80 in 5 days 
$40 Milestone Requested
4 months ago
4.6

1.8

1 Review
100% Completion Rate
I can do this with parser . I will use javacc to generate parser program for wild card string search.
tomky Poland
tomky
Poland From Poland     Offline
$330 in 25 days 
0
4 months ago
5.0

1.0

1 Review
50% Completion Rate
I am sending details of my bid in the private message. Regards.
rakhitt20 India
Facebook_2870620.jpg
rakhitt20
India From India     Offline
$80 in 20 days 
0
4 months ago
5.0

1.0

1 Review
100% Completion Rate
Hi, I am capable of deliver this project with high quality and on time.Post deployment support will be provided free of cost. Regards Rakhi Guha
matoliamk India
M1.JPG
matoliamk
India From India     Offline
$100 in 20 days 
0
4 months ago
Dear sir, Please check private message.
tks73 Russian Federation
cat-1.jpg
tks73
Russian Federation From Russian Federation     Offline
$100 in 3 days 
0
4 months ago
Hello! Please see PMB - there is an example program. I use C# on VS2008. I think the best choice for searching words is regular expressions (aka Regex). Good luck!
eduardooo Romania
ompGCl819862-02.jpg
eduardooo
Romania From Romania     Offline
$120 in 7 days 
$12 Milestone Requested
4 months ago
0.0

0.0

0 Reviews
83% Completion Rate
Hi. A student at a technical university which studies computer science here. Please check the PMB.
taivt123 Viet Nam
defaultAvatar.jpg
taivt123
Viet Nam From Viet Nam     Offline
$150 in 8 days 
$150 Milestone Requested
4 months ago
- I want bid this project. - I will using vs2010 implement.