JAVA utility class - search for human names in Polish texts

  • Status Closed
  • Budget N/A
  • Total Bids 16

Project Description

Goal is to create a utility class in Java with one public method:

public List findNames(String text);


NamePl is class representing information about human name and has these properties:

private String firstname;

private String middlename; // optional

private String surname;

private Gender gender;

private int[] positions; // where was the name found (all positions)

Gender is enum {MALE, FEMALE, UNKNOWN}

Position is index of the first character in original text, where the name was found.

The method should be as reliable and as fast as possible and should return names in its basic form. It can use some form of a dictionary if it needs to. It will be used for batch processing of a large amount of data (news articles). It must be thread-safe (multiple threads will call this method in parallel). Method shall return empty List if nothing is found.

Successful implementation must pass real-world test on sample corpus.

Knowledge of NLP and Polish language would be quite an advantage.

Get free quotes for a project like this
Skills Required

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online