Text similarity comparison + data analyzer

Closed

Need to create software(script) that will be store, filter and analize data. The number of records in the database can be up to hundreds of thousands of records, you should use the most optimized algorithms and development technologies.(New data will be upload every day) The process of working has next structure: .CSV data file -> Database and text comparation analizer-> Processing macros and filters -> output in .TXT format 1 step) Loading data from a .CSV file with a fixed structure in the current database 2) After data uploaded, it should compare one field(text) for each record with another records in !ALL DATABASE for similarity (!!! This is the most difficult part of this project, it need to compare two text(two records), and return similirity of it in percents) Example of working you can find at: [url removed, login to view] [url removed, login to view] (There is russian language interface(can be translated in Google translate)) After current record was compared with all records in DB, it add info of MAXIMUM percent of similiry and ID of the record that is most similiar to. So we saved this info for each record in db. 3) One record has next structure: Field 1;Field 2;Field 3;...;Max percent of simility;ID of most similiar record 4)The ability to create flexible filters (macros) to sort the data (filters (macros) should be able to save) Macro consists of several filters (fields has different types: date, text, numerical).. For example Macro = ( Field 1 contains "John" And Field 4 is equal to "address" OR field 4 is equal to "Andy" ) So macros has a complex structure with the logical relations between the filters inside AND \ OR 5) After processing the macro data that we received, export in .TXT file !!!ALL ADDITIONAL INFO AND DATA SAMPLE WILL BE PROVIDED!!!

Skills: Big Data, C# Programming, PHP, Software Architecture, Visual Basic

See more: use of data structure, types of data structure, types of algorithms, types data structure, translate ru, to find translate russian, text comparison algorithms, text algorithms, step up technologies, software development algorithms, process data structure, new data structure, need to translate text from russian, google translate find language, google ru, find translate google, example of algorithms, different types of data structure, different types of algorithms, different types data structure, development of basic algorithms, development of algorithms, data structure with c, data structure types, data structure sort

Project ID: #4028930

9 freelancers are bidding on average $189 for this job

ultrasonicsoft

Hi There, I have very strong hands on C#.NET, WCF, SQL Server, VC++. I could deliver you this project with good quality and unit testing. Thanks!

$225 USD in 20 days
(11 Reviews)
4.3
zeromaxsolution

Please see PMB.

$250 USD in 20 days
(7 Reviews)
4.0
renanreis

Hi, I can help you

$150 USD in 6 days
(15 Reviews)
3.9
pragyaatech

in simpler words you need CopyScape, you need it as desktop app or web app ?

$250 USD in 30 days
(2 Reviews)
3.0
rishijain83

I have executed a similar project for a logistics company in which I had to match customer names and addresses. It required daily updation as in your case. Matching algorithm is certainly tricky, but I have seen it More

$200 USD in 15 days
(0 Reviews)
0.0
OZM60bR3h

We are freelance software developers. If you contact me I can give a quote for your project and we can discuss the details. www.removed by admin

$140 USD in 1 day
(0 Reviews)
0.0
Yr12U7EuS

We are freelance software developers. If you contact me I can give a quote for your project and we can discuss the details. www.<b><i>Removed by Admin</i></b>

$140 USD in 1 day
(0 Reviews)
0.0
ksharpvw

If you have SAS software. I could code it for you.

$200 USD in 2 days
(0 Reviews)
0.0
trinity09

I can develop a optimized solution to fulfill your requirements.

$150 USD in 3 days
(0 Reviews)
0.0