Adapt a perl script in order to extract content from a web site (regex)

In Progress

We have a perl script that extract contents from a web site using curl and pdftotxt.

This script need to be adapted for each new target.

We would like to have it adapted for an other web site.

It uses some loops and some regex to extract the content.

We have potentially plenty similar scripts to be implemented as next steps if things are going well for this first shot.

Skills: Linux, Perl, Regular Expressions

See more: using regex in c, regex in c, regex c, order web site, buy web site astrology content, web site deign flash xml action script, login extract web content visual basic script, extract web site content free, extract web content web site, extract content web site, extract content web site word, simple perl script parse data web site, content sharing mobile web site script

Project ID: #9567647

Awarded to:

bhatkiran

I am a software engineer with over 5 years of extensive development expertise. I am well versed in perl and can modify the program. Please share me the file and the target site and I will do the needful.

$20 USD in 3 days
(2 Reviews)
1.5

6 freelancers are bidding on average $28 for this job

tomkusvw

Hi! My name is Tomasz Kustra, and I am from Poland. I am interested in this project. Can you send your current script, new webpages and describe what have to be changed? I am experienced programmer, you can More

$30 USD in 2 days
(20 Reviews)
4.7
artgerasimov

Hello! I am an experienced Perl programmer (15+ years). I'll glad to discuss the details in the chat.

$35 USD in 1 day
(2 Reviews)
3.0
raulbehl

Hello! I've used perl for a variety of purposes. It'll be great if I could help you out as well. Hope you do contact. Thanks!

$20 USD in 1 day
(4 Reviews)
2.4
$35 USD in 3 days
(0 Reviews)
0.0
elbruninh

Hi, i'm interested, could you give me more details please? what is the new site to scrape? could you send me the script? regards http://elbruninh.sytes.net/

$30 USD in 1 day
(0 Reviews)
1.6