Find Jobs
Hire Freelancers

Bot Needed to Extract Email & Save as CSV File w/Existing Data

$250-750 USD

Cancelled
Posted over 11 years ago

$250-750 USD

Paid on delivery
Read everything below to fully understand the project. Do not bid until you have read in full. Personal messages along with the bid will be given extra attention. If a requested feature drastically increases the price, mention how much it is with and without it so that I can correctly compare your bids to the others... During the process, it is very important that we stay in contact with one another. Thanks, Steve OUTLINE I need a program that I can run on Windows to extract email addresses from URL in an existing CSV file and save the results into the same file which contains other data. CSV has this column structure: A- URL B- Email C- Company D- Contact E- Address F- Phone EXAMPLE DATABASE [login to view URL] FEATURES - I need these; .com, .co, .net, .biz, .us - Use comma if more than one email found. - Nulti-threading which can be adjusted by the user (1-30) - Must load data into database (ie: sqlite) for scraping. There are times where I will use this for 100 URL’s and times where I will want to use it for 100k URLs. So it is important that the results be saved either in the CSV or DB in case of a loss of internet or PC restart. - Must be able to read URLs in this format; http, www, and [login to view URL] - Scrape email in source code and screen scrape (for email that is output with JavaScript). If this increases price, let me know. FUNCTION The program will pull the URL (which I can always make column A), scrape the website for email and post the results into the Email column (column B). The program needs to have three scraping modes to help with speed. Do not scrape external URL’s or redirects. 1) Slow - Full scan of entire website (50 URL max) 2) Medium - Scrape only the links found on the initial landing page and stop scraping after 30 URL's 3) Fastest - Scrape only these pages; landing page, contact-us, contact, contactus, about, about-us, aboutus, staff. If these pages have extensions (php, jsp, htm, apx, html, etc), that means that case does matter. So we also have to have Contact-us, Contact, Contactus, About, About-us, Aboutus, Staff, ContactUs, Contact-Us, About-Us. And sometimes, the "contact" page is a folder such as [login to view URL] (max 15 domains) I will use as many threads as I can, and run all URL’s in ‘Fastest’ mode. Then, if there are domains that do not have URL’s, I will run it in Slow or Medium (since it will take longer). One GUI where I will select the file, watch the process, and if possible, specify the URL/time limit for each option (Slow, Medium, Fastest). If that increases the price, let me know. I may later decide that it is better to have a time limit instead of URL limit and will want the ability to change this without rewriting the program. The program will save the results into a new CSV file which defaults to the original file name with the word RESULTS added to the end of it. If it cannot default to the original file name, it should call itself [login to view URL] Since many websites have forms, it would be nice to know this so that I do not continue trying to process those. Maybe the program can detect the <form> code and put FORM in column B so that I can skip those and keep it for my records. DEMO I will want to test this along the way. The demo you provide will need the ability to test at least 50-100 URL’s. It’s much harder to get a good idea of performance with a smaller list. SOURCE CODE I want the source code once the project has been completed. As long as you are available, I will continue to work with you if changes are needed, but if you are unable to be reached, I will need to take it to someone else to receive help. SUPPORT Two-weeks of support once the project is finalized. There are emails that will be missed so revisions will be needed.
Project ID: 2336622

About the project

8 proposals
Remote project
Active 12 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
8 freelancers are bidding on average $500 USD for this job
User Avatar
We can help in your project, please check PMB and our ratings/reviews to get idea of our experience.
$250 USD in 7 days
4.8 (239 reviews)
7.8
7.8
User Avatar
....................
$250 USD in 0 day
5.0 (87 reviews)
6.5
6.5
User Avatar
HI, Kindly see details in PMB Thank you
$550 USD in 4 days
5.0 (39 reviews)
6.1
6.1
User Avatar
If you are looking for an expert - I am the person for the job. Please check your PMB.
$650 USD in 7 days
5.0 (13 reviews)
5.0
5.0
User Avatar
Please see PMB.
$700 USD in 10 days
5.0 (11 reviews)
4.8
4.8
User Avatar
Please check your private messages.
$600 USD in 7 days
5.0 (1 review)
2.0
2.0
User Avatar
Custom software development - <b><i>Removed by Admin</i></b>
$750 USD in 1 day
0.0 (0 reviews)
0.0
0.0
User Avatar
Please check the PMB
$250 USD in 1 day
0.0 (0 reviews)
0.0
0.0

About the client

Flag of UNITED STATES
Lexington, United States
5.0
45
Payment method verified
Member since Apr 6, 2011

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.