Regex for international dates and company entity types

  • Status: Open
  • Prize: $900
  • Entries Received: 28

Contest Brief

I stress that this project involves as much RESEARCH as code-writing. The different formatting used internationally is vital to get right first; the coding that follows is easy.

This project is to write a script containing a series of regex scans that will extract metadata from plain documents. The purpose is to scan a big-data archive of document and retrieve specific meta data such as dates and company name.

Script execution speed is essential. Ideally the script is written in Perl, but python or php is okay.

This must work with international meta data! Please do not expect your entry to win without this prerequisite. The plain text source will be in UTF8 to cope with international characters.

The script can return in any practical format, as long as the format can be imported:-
E.g. Json, Serialized array

The document describes script sections - which means create one include file that can be included in a different project, and demonstrate how to call the functions. So there would be at least 4 main functions.

To make good use of your time, I'd suggest you first research the international dates and international company types, and send me privately a document. Obviously don't include what is already in Wiki. I can then give you feedback on whether that is comprehensive. Once you have that feedback it becomes worthwhile to code.

Good luck!

Recommended Skills

Top entries from this contest

View More Entries

Public Clarification Board

  • kabapy
    kabapy
    • 1 week ago

    #51 multi threaded python script with JSON output, please review the screenshots and contact me on the chat to discuss how to send you the script privately

    • 1 week ago
  • kabapy
    kabapy
    • 1 week ago

    Please contact me on the chat to discuss how to send you the script privately

    • 1 week ago
  • kabapy
    kabapy
    • 1 month ago

    #46 the script is ready for you to review

    • 1 month ago
  • kabapy
    kabapy
    • 1 month ago

    #46 the results of the 4 tasks in 4 excel files. please contact me on the chat if you have any comments or modifications

    • 1 month ago
  • kabapy
    kabapy
    • 1 month ago

    #46 Finished The 4 Tasks, Please Review

    • 1 month ago
  • NabeelShaikhh
    NabeelShaikhh
    • 1 month ago

    Hi Sir Please let me know
    You need on Web or Local ?

    • 1 month ago
    1. sunnyguptahotels
      Contest Holder
      • 1 month ago

      Local

      • 1 month ago
  • kabapy
    kabapy
    • 1 month ago

    entry #43

    • 1 month ago
  • naveendurai
    naveendurai
    • 1 month ago

    Can you check my entry #42 ? See whether I am going in right direction.

    • 1 month ago
    1. sunnyguptahotels
      Contest Holder
      • 1 month ago

      Please never participate in my contests.

      • 1 month ago
    2. naveendurai
      naveendurai
      • 1 month ago

      Why? What did I do ?

      • 1 month ago
  • HDevCrea
    HDevCrea
    • 1 month ago

    We can't post our entries if the contest is not sealed. Everyone will see it.
    #sealed

    • 1 month ago
    1. HDevCrea
      HDevCrea
      • 1 month ago

      I already did. It was entry #26.

      • 1 month ago
    2. sunnyguptahotels
      Contest Holder
      • 1 month ago

      Somehow it has gone. Can you skype me - sunnygupta1000

      • 1 month ago
  • StromlightTech
    StromlightTech
    • 2 months ago

    Entries are no way related to contest.. lol

    • 2 months ago
    1. sunnyguptahotels
      Contest Holder
      • 1 month ago

      You might prefer this project. no one has even come close.

      • 1 month ago
  • HDevCrea
    HDevCrea
    • 1 month ago

    Please check Entry #26 so I can send you the first script.

    • 1 month ago
  • ikobir
    ikobir
    • 1 month ago

    Sir, if you need any change tell me. thanks

    • 1 month ago
  • ikobir
    ikobir
    • 1 month ago

    Sir,Kindly check my entry#23,#24,#25. and if you need any change tell me. thanks

    • 1 month ago
  • sunnyguptahotels
    Contest Holder
    • 1 month ago

    Unfortunately I can't work with Node\JS\Java - its not the language but the coding support that I can't do.

    • 1 month ago
    1. KishuPro
      KishuPro
      • 1 month ago

      Hmm I understand, thanks for the response.

      • 1 month ago
  • sunnyguptahotels
    Contest Holder
    • 1 month ago

    Has anyone started or should I invite others?

    • 1 month ago
    1. KishuPro
      KishuPro
      • 1 month ago

      "Script execution speed is essential" - Do you think Node/JS/Typescript or Java would be too slow/out of scope for your purposes?

      • 1 month ago
  • ethmain
    ethmain
    • 2 months ago

    Hello, what do I do when I am done? I don't want to put it in a contest because people can take my work and do more with it..

    • 2 months ago
    1. ethmain
      ethmain
      • 2 months ago

      is it possible that you attach a sample of the business document? so we can have a better idea on what you will need?

      • 2 months ago
    2. ArnabGuchait
      ArnabGuchait
      • 2 months ago

      Can we place some type of watermark/authentication proof/set a final (non-editable) copy of our work ?

      • 2 months ago
  • sssalim018152347
    sssalim018152347
    • 2 months ago

    please check my entry #14

    • 2 months ago
    1. sunnyguptahotels
      Contest Holder
      • 2 months ago

      How is that possibily an entry. My suggestion is not to waste your time.

      • 2 months ago
  • sunnyguptahotels
    Contest Holder
    • 2 months ago

    Please do not spam my contests!

    • 2 months ago
  • RajakScripts
    RajakScripts
    • 2 months ago

    Now, I assume you actually want the regex to process any text content on a document, NOT the literal metadata from a file (of a document) itself, correct?

    • 2 months ago
  • RajakScripts
    RajakScripts
    • 2 months ago

    Hi, could you please attach some docs to firstly reveal its metadata so I will have a better picture before doing the regex?

    • 2 months ago
    1. sunnyguptahotels
      Contest Holder
      • 2 months ago

      Just look for any business contracts. E.g. https://www.printablecontracts.com/General_Contracting.php

      • 2 months ago

Show more comments

How to get started with contests

  • Post your contest

    Post Your Contest Quick and easy

  • Get tons of entries

    Get Tons of Entries From around the world

  • Award the best entry

    Award the best entry Download the files - Easy!

Post a Contest Now or Join us Today!