I need a tool that will extract contact information for job posters from [url removed, login to view], [url removed, login to view], and HotJobs.com. Contact information more specifically is name, email, fax, and possibly address.
Since these sites do not contain this information in a structured format, when you PM me also let me know what approach you are going to take, otherwise don't bother PM'ing.
I don't intend to scan all the resumes on the sites (there are millions) so I need a suggestion on how I should approach this (i.e. search an attribute and all the contact information from the search results are stored)
I also need a mechanism to make sure duplicate contact information is not stored.
I am not bothered about the technology, thats why I selected all the Job Types, however deployment of the application to be very easy so define what technology you will use.