In Progress

boolean search tool to search csv files -Single and 2 Byte

I need a tool which will allow me to search through about 4000 csv files each one being about 30 mb (total is about 100 gigabytes of data) for keywords. I would like to be able to use wildcards as well. I need a visual GUI as well. also a drag and drop interface. I am attaching a sample csv file. The search should be FAST. I want the search to be fast enough to at least be able to search through all the files for a single keyword in 90 minutes.

I want to have searches like

("red barron" or "green curry" or ("hot potato" and "cold soup")) and "sour dough" and NOT "french fried potatoes"

I would like to have wild cards for prefixes, suffixes, and within the words or phrases.

Please tell me the wild card functionality you can provide in your PM bid.

Notes: If I search for the word 'wine' I don't want a word like 'swine' unless specified by some sort of wild card.

THIS TOOL MUST BE ABLE TO SEARCH 2-byte languages like Japanese, Korean, Chinese. NOTICE that in the sample input file (attached) there are both two byte and single byte text in there. This must be handled properly. It can have an option box to say whether the character set is Western or not if that will speed up Western character searches.

Of course, there should be an option for giving a name and location to the resulting output file.

THE OUTPUT FILE MUST BE (see attached sample [url removed, login to view] file):

return the same fields as the input file

Date, Username,Text, location fields must all be quoted by default but there should be an option for whether a field will be quoted or not for any of the fields.

All commas except field delimters must be removed by default but there should be an option for each field to leave them in.

The key to whether the output file is in the correct format is whether it can be opened in Excel, saved by Excel, and then reopened and still have the correct number of fields. Also, Excel should read the date field as a date.

There should be some sort of progress bar to show how many files have been searched.

In the attached file there are 7 fields. The main field to be searched is field number 4. Other fields should be searchable as well however. Any field in the file should be searchable by selecting that field number (the default should be column 4, the "Text" field.)

If I have a new format for the csv files with additional columns then the tool should be flexible enough so they should also be searchable.

There should also be a Username, Location, Tweet ID range, Date Range search boxes on the tool.

Skills: C# Programming, C++ Programming, Java, Perl

See more: search csv, search word csv file, byte csv file, boolean csv files, search csv file, boolean search phrases, words number prefixes, search prefixes suffixes, search red, search prefixes, red search, prefixes word search, prefixes suffixes search, notice leave, correct username, to search, about wine, visual search, soup, search files, search file, potato, hot cold, french japanese, CSV

About the Employer:
( 53 reviews ) Beppu shi, Japan

Project ID: #1254396

Awarded to:

aboltinsh

I can create this tool. See details in details in PM.

$150 NZD in 2 days
(10 Reviews)
4.6

9 freelancers are bidding on average $143 for this job

tomky

I am sending details of my bid in the private message. Regards.

$330 NZD in 25 days
(79 Reviews)
5.7
greggfletcher

Hello, sounds like an interesting and challenging project. I am a C# programmer with over 4 years of experience. If interested please contact me, so we can discuss further. Regards.

$175 NZD in 5 days
(3 Reviews)
2.2
javagroups

I can do this with parser . I will use javacc to generate parser program for wild card string search.

$80 NZD in 5 days
(1 Review)
1.8
rakhitt20

Hi, I am capable of deliver this project with high quality and on time.Post deployment support will be provided free of cost. Regards Rakhi Guha

$80 NZD in 20 days
(1 Review)
1.0
taivt123

- I want bid this project. - I will using vs2010 implement.

$150 NZD in 8 days
(0 Reviews)
0.0
eduardooo

Hi. A student at a technical university which studies computer science here. Please check the PMB.

$120 NZD in 7 days
(0 Reviews)
0.0
tks73

Hello! Please see PMB - there is an example program. I use C# on VS2008. I think the best choice for searching words is regular expressions (aka Regex). Good luck!

$100 NZD in 3 days
(0 Reviews)
0.0
matoliamk

Dear sir, Please check private message.

$100 NZD in 20 days
(0 Reviews)
0.0