Script identifier for Unicode strings in Perl - Linux

I want to build a Perl script to identify the "script" that a particular UTF-8 string is written on. For example, given the strings:

دجنبر --> Arabic

децембар --> Cyrillic

João --> Latin

נאָװעמבער --> Hebrew

กันยายน --> Thai

цембJair --> Mixed

by looking at the "Script" property of each character and checking if they all belong to the same script and in this case report the name. If the string is a mix of two scripts then it should return "Mixed".

The best way to get there would be to use the program "uniname" and echoing the string into it

echo กันยายน | uniname -b -g -c -e -r -u -n

and then process the output:









Basic Latin

to eliminate the first line (a header) and the last line (it corresponds to the LINE FEED at the end of the word). If all character belong to the same range, then report that range. If not, return the word "Mixed".

The program is available from here:

[url removed, login to view]

Paulo Ney

Skills: Perl

See more: perl get, latin to, strings, perl linux, thai script, perl word html, basic program linux, perl output, word utf, echo script, arabic thai, linux perl script, perl header line, hebrew html, linux report, report linux, program strings, perl linux health check script, linux unicode, perl script report, perl report script, perl linux system management script shell, delete line file perl linux script, html output word, script r

About the Employer:
( 63 reviews ) Oakland, United States

Project ID: #5019738

10 freelancers are bidding on average $55 for this job


Interesting problem :) I'd just like to work it out. I suppose the uniutils you're referring to are available on the platform you want to run the code, so this project can use them as they are? Thank you.

$30 USD in 3 days
(11 Reviews)

Easy enough if using the 'uniname' program. At most several tens of Perl code lines. Best regards, Radu

$15 USD in 2 days
(2 Reviews)

Hi, I am a full-time developer of Perl and CGI, and I work with most platforms (Linux, Solaris and Windows) and would like to help with your task. You can check my existing ratings on Freelancer and also see that More

$20 USD in 1 day
(5 Reviews)

Hello, So basically you need to feed each line to uniname tool and compare its output lines, except the first one and the last one, and then print the result or 'Mixed' if there are characters from different scripts More

$30 USD in 2 days
(3 Reviews)

Can help... I am an Expert... Please check the past projects I have handled and check my reviews for what employers have to say about my work... Can start right now...

$300 USD in 7 days
(1 Review)

Hi, I have implemented this Perl script according to your description. I can give you the file right now when you award this job to me. Thanks!

$30 USD in 1 day
(3 Reviews)

Hello , My skills , develoopper in perl , php , administrator in linux systems & database . I can do your job quickly . You can have this for sunday .

$50 USD in 1 day
(2 Reviews)

New on [url removed, login to view], but 15 year of Perl Programming in IT areas. Very simple program, I'd be glad to complete this task.

$25 USD in 1 day
(0 Reviews)

i can do this thing with in specified time by using PERL. i am 2 years experienced guy in processing files for CISCO client in my regular job.

$30 USD in 6 days
(0 Reviews)

I am basically a linux admin. Love to do some coding in PERL. I have experience in QA. I have some questions: 1.) How do you intend to furnish your output. ( eg: file, string of multiple words, or string of s More

$20 USD in 3 days
(0 Reviews)