Data analyzer + text similarity comparison

Budget $250 - $750 USD
Bids 12
Average Bid $584
Status Closed

Need to create software(script) that will be store, filter and analize data. The number of records in the database can be up to hundreds of thousands of records, you should use the most optimized algorithms and development technologies.(New data will be upload every day)

The process of working has next structure:

.CSV data file -> Database and text comparation analizer-> Processing macros and filters -> output in .TXT format

1 step) Loading data from a .CSV file with a fixed structure in the current database

2) After data uploaded, it should compare one field(text) for each record with another records in !ALL DATABASE for similarity

(!!! This is the most difficult part of this project, it need to compare two text(two records), and return similirity of it in percents)

Example of working you can find at:

[url removed, login to view]

[url removed, login to view] (There is russian language interface(can be translated in Google translate))

After current record was compared with all records in DB, it add info of MAXIMUM percent of similiry and ID of the record that is most similiar to.

So we saved this info for each record in db.

3) One record has next structure:

Field 1;Field 2;Field 3;...;Max percent of simility;ID of most similiar record

4)The ability to create flexible filters (macros) to sort the data (filters (macros) should be able to save)

Macro consists of several filters (fields has different types: date, text, numerical)..

For example

Macro =


Field 1 contains "John"


Field 4 is equal to "address" OR field 4 is equal to "Andy"


So macros has a complex structure with the logical relations between the filters inside AND \ OR

5) After processing the macro data that we received, export in .TXT file


Get Free Quotes For A Project Like This

This project was awarded to


Great Job....Thanks [20 June, 2015] Great Job as always... Thank You... [20 June, 2015] Always does great work.. [04 September, 2015] Great Work agan... Thank You [07 September, 2015] Awesome As Usual... Thank You [09 September, 2015] OUTSTANDING.................. [19 September, 2015] Real Good Work.. Thanks [27 September, 2015] Good Job as Usual.....Thanks [25 October, 2015] Fantastic Job as Always..Thank You [02 November, 2015] Does Great Work. Thanks A Lot.. [28 November, 2015] Once Again He Saves The Day....Thank You... [30 November, 2015] Great Work & Fast. Thank You... [18 December, 2015] Super FreeLancer...Does Great Work...Thank You [05 January, 2016] Great Job as Usual. Thanks [05 January, 2016] Great Work as Usual..Thanks.. [19 February, 2016] Does Great Work.. Thank You [02 March, 2016] Great Job Once Again. Thank You Very Much. [15 March, 2016] Great Work...Thanks [18 March, 2016] Fantastic Work.. Thank You for the Quick Service..
About the Freelancer
sveralex Profile Picture

I possess 10+ years experience in Web and Database Development, including Delphi, PERL, LAMP, CMS, Smarty, JavaScript, AJAX, jQuery, HTML, CSS, XML, JSON. MySQL, Delphi, PHP Certified. I have a proven track record of successful projects in various programming fields, including such CMS as WordPress, OpenCart, Prestashop, Magento. Communication and Feedback's is the most important input from you. So I am available around 16-18 hours a day. I provide 24 hours supports and free revisions.

Looking to make some money?

  • Set your budget and the time frame
  • Outline your proposal
  • Get paid for your work

Bids on this Project

  • ngcomp Profile Picture


    San Jose,  United States

    NGComp is a team of highly skilled people in a) Map Reduce using Hadoop, HBase, CouchDB, MongoDB. b) Cloud Computing across IAAS, PAAS and SAAS layers on Amazon EC2, Xen Platform including. Citrix Xenserver VMWare vSphere, vCloud Director. Auto-Scaling c) Super NOSQL expertise using CouchDB, MongoDB, Redis, MemBase, MemCache, HBase, d) Data Mining/Analytics Data mining using various algorithms and NOSQL databases. e) Virtualization using Xen, KVM, ESXi f) Expert for VMware stack (VSphere, VCloud Director, VSM, Vmware Horizon)

  • eugene2006 Profile Picture


    Dniepropetrovsk,  Ukraine

    15 years of Lotus Domino developing and administration. Expert in MS Office VBA

  • pcman1ac Profile Picture


    Lviv,  Ukraine

    Architect of ERP systems with Data Mining options. Platform - Delphi/Firebird. Website developer/hoster using LAMP/Drupal with SEO full service.

  • rishijain83 Profile Picture


    Bangalore,  India

    -Have got rich experience in executing Analytcis projects from conceptualizing to final delivery. -Have executed projects for various fortune 500 companies across sectors. -Have a strong team to work with me on different projects -Have worked with people across geographies and hence customers can easily get things done!

  • AlosDeveloper Profile Picture


    Kiev,  Ukraine

    ASP.NET, PHP, iPhone/iPad, MSSQL, MySQL, Oracle, C# - 5 years experince, Delphi - 11 years experince. If you select me, you will be 100% satisfied. This is my Guarantee.

  • Eb2THqM14 Profile Picture


    Adana,  Turkey


  • boyet0911 Profile Picture


    Virac,  Philippines

    Experienced application developer using VB6, VB2008, VB2010, VB2013, MS Access, MS SQL, MySQL Databases

  • aegansys Profile Picture


    aegan,  India

    RELIABLE & HONEST Windows developer (C/C++/C#) ... 100% SUCCESS GUARANTEED! details about my past jobs are available here:

  • greggfletcher Profile Picture


    Popovo,  Bulgaria

    5 years of experience in web scraping .

  • spyrosn Profile Picture


    Agia Paraskevi,  Greece

    12 years of developing experience in various languages, including, but not limited to, C#, C++, Java, VB.NET, VB 6, and C. Have been designing and developing multitier applications (both WinForms & ASP.NET) since 2003. Additional experience in database development (SQL Server 7 / 2000 & MySQL). Have participated in most stages of an IT project lifecycle: analysis, design, implementation, testing, as well as user support. Acted as project leader in small (3-4) developer & tester groups.