We have a list of potential customers for our online geotagging platform. The list primarily includes online content publishers like news organizations and publishers. We need to make a list of the website URLs for these potiential customers as well as regular expressions (RegEx) for identifying links to articles on these sites. The purpose of doing this is to automate the process of running demonstrations of our service to these potential customers. We will provide a list of the company names and a spreadsheet for entering the URLs and RegEx for each company/publisher. The regex's should be constructed in the Python regular expressions dialect. Given a list of HREFs for each site, the regex should be able to indicate whether or not each HREF is a native article/post of the associated URL. In other words, the regex should not match any links to external sites or articles.
The task also includes finding the appropriate URL for each company which in some cases could require some web searching.
Example: Given the company name "TV2 (Norway)" the task will be to find and enter the url "[login to view URL]" along with a regex /\w+/\d+/ which will match native articles on tv2.no.
7 freelancers are bidding on average €15/hour for this job
Greetings!We are ready to deliver you few samples as per the requirement. Can you please reply on our bid, so we can able to share a sample file with you. Thanks Vishal