Closed

JAVA utility class - search for human names in Polish texts

This project received 16 bids from talented freelancers with an average bid price of $367 USD.

Get free quotes for a project like this
Employer working
Skills Required
Project Budget
N/A
Total Bids
16
Project Description

Goal is to create a utility class in Java with one public method:

public List findNames(String text);

where

NamePl is class representing information about human name and has these properties:

private String firstname;
private String middlename; // optional
private String surname;
private Gender gender;
private int[] positions; // where was the name found (all positions)

Gender is enum {MALE, FEMALE, UNKNOWN}
Position is index of the first character in original text, where the name was found.

The method should be as reliable and as fast as possible and should return names in its basic form. It can use some form of a dictionary if it needs to. It will be used for batch processing of a large amount of data (news articles). It must be thread-safe (multiple threads will call this method in parallel). Method shall return empty List if nothing is found.

Successful implementation must pass real-world test on sample corpus.

Knowledge of NLP and Polish language would be quite an advantage.

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online