online
Online now
Hire Me!
Rate: $44.00 USD/hour
Follow Invite to Project
 

lenzai

Webscraping expert. Call me for data extraction, data mining , web robots.

Username: lenzai

  • Has made a deposit.
  • Has verified their email address.
  • Has completed their profile.
  • Has verified their secure phone number.
  • Verified
  • Payment is verified.

Location: TRESBOEUF, France

Member since: October 2010

Reputation:

5.0/5

(21 reviews)

4.6
[see more]

No user has recommended this freelancer.

My projects:

  • $50.00 USD
    5.0
    Profile image for Seller cuteprince85

    cuteprince85

    Sep 15, 2012

    Very nice communication and training skills, deep knowledge of software design, I ll use lenzai when I need coaching for architecture and methodology.

    Project Description:I'm looking for PHPunit & GIT expert to provide training over skype,. Feel free ask question by pm! No time wasters please !
    [more]
  • $385.00 USD
    5.0
    Profile image for Seller locafroid

    locafroid

    Sep 9, 2011

    Someone who really knows what they are doing. Hard to find on freelancer. Excellent work thankyou

    Project Description:Robot for posting on sales website products from us
    [more]
  • $175.00 USD
    5.0
    Profile image for Seller appzman

    appzman

    Apr 20, 2011

    Was great working with this coder! He was very professional to deal with, and delivered very clean and efficient code. Highly recommended.

    Project Description:[Project Description hidden]
    [more]
  • $7.00 USD
    5.0
    Profile image for Seller chenal

    chenal

    Mar 17, 2011

    Great advice -- just what we needed at this stage.

    Project Description:This project is to help us decrease load times for <http://www.kleargear.com> You will analyze the content delivered -- including the efficient use HTML -- and provide specific recommendations (CSS, JS, HTML, multimedia etc.) to maximize load speed...
    [more]
  • $99.00 USD
    5.0
    Profile image for Seller vw7468940vw

    vw7468940vw

    Feb 24, 2011

    Rating: 5.0/5.0

    Project Description:[Project Description hidden]
    [more]
  • $114.75 USD
    5.0
    Profile image for Seller vw7468940vw

    vw7468940vw

    Feb 24, 2011

    We are very happy with the product, work, and communication of lenzai. His application does exactly what we need it to do. Code was audited by in-house QOS and passed flawlessly. I would recommend lenzai to associates and will be awarding future work. Thank you.

    Project Description:Using the SMF Add-in for Excel 2003 or similar automated tool. ? Scrape the website premium.econoday.com for all economic events back to the? beginning? of the websites records (sometime in 1999-2000)...
    [more]
  • $45.00 USD
    5.0
    Profile image for Seller appzman

    appzman

    Feb 21, 2011

    Rating: 5.0/5.0

    Project Description:[Project Description hidden]
    [more]
  • $76.50 USD
    5.0
    Profile image for Seller appzman

    appzman

    Feb 21, 2011

    Very responsive, and work was complete faster than expected. He was able to creatively find a solution and effectively build the script. Highly recommended!

    Project Description:I'm looking for a coder to write a PHP script that acquires a list of all apps in the Android Market. Does not need to parse apps, just return a list in the format: com.zedd.game.goround com.coding.Superflightcontrol...
    [more]
  • $200.00 AUD
    5.0
    Profile image for Seller henrysample

    henrysample

    Dec 30, 2010

    Lenzai did exactly what he said he would. Great service. I plan on using him again. He also took the initiative and is one smart fellow.

    Project Description:I require a Freelancer to copy and paste data from a website into an Excel spreadsheet. The website relates to the performance of different sharemarket investment recommendations by different people (I simply wish to find out which picks performed the best)...
    [more]
  • $63.75 USD
    5.0
    Profile image for Seller amazonphp

    amazonphp

    Dec 14, 2010

    Good job. First delivery was final and ready to run on production server.

    Project Description:[Project Description hidden]
    [more]
    lenzai has not completed any projects.
  • $40 USD/hr In Progress

    Small bit of work on data retrieval

  • $500 USD In Progress

    We need a price comparison site designing and building. The wesbite will compare ticket prices from 3 major websites.The website will need to collect data in real-time and not stored aggregated data.It will compare ticket prices collected from data from 3 websites. The customer will have the abiliity to click on the ticket to purchase, which will then direct them to the correct website... at the correct page, so all they need to do is check details and pay (our website will not be taking payments at all so please do not price for that). It must not take them to the homepage and have them go through the whole process again, IT MUST TAKE THEM TO THE CORRECT PAGE OF THE CORRECT WEBSITE, FOR THE COORECT TICKET.It will need the ability to track purchases so we have an exact record of how many people we have succesfully tranfered to purchase and the amount who have actually purchased.Google - Once active, the website must list top on google for its sector and sub-sectors.Payment as follows:50% will be paid once the website is online and functioning correctly.25% will be paid once listed at top of google as per above.25% after 3 months correct operationOr if you would just like to bid for the scraping part the following payment plan is in-place:50% when published and after 1 -2 weeks testing50% after 4 weeks of correct funtioning.PLEASE NOTE THE INTERIM PAYMENT SCHEDULE ABOVE IS VERY MUCH ONLY PART OF THE BIGGER PICTURE AS WE SEE THIS AS AN ONGOING PROJECT, WHICH WILL BE FAVOURED TOWARDS THE ORIGINAL SCRAPE DESIGNER.We will need to own the rights to the source code.Any Questions Please ask,Many Thanks, SGI

    [more]
  • $1100 USD In Progress

    We are looking for a historical database of all applications ever to appear on android market including price and ranking histories.

  • $1500 USD In Progress

    I need a php script that reads URLs (e.g. www.ibm.de) from a MySQL Database and alternatively via a simple php function call that receives a single URL as well as the country code of the website-operator and then 1.) fetches the website2.) searches for the occurance of a link (a href and onclick=location.href) to the contact page / imprint / about us page (list of words/strings to search for should be defined in an array in the config file for the script including priority what means if a word with higher priority will be found, the Script will search on this page first, if the Script finds contact data there it stops searching for that domain). Priority can be done by order of array values. 3.) Extract contact data from the site and save it in the suitable database fields like Company Name, Street, postcode, city, phone, Fax, Mail, CEO, tax id, company registration id in case the URL was read from DB or return tha data as Array in case the URL came from a function call. To make extraction easier I provide a Country Code for each url. Based on this code there have to be rules how e.g. adresses might be formated (Syntax). In this project only German (AT, DE, CH-country codes), UK and US adresses have to be extracted. Each country code should have a OEM configuration file, so that the script could be adopted and extended in the future. If no Country Code is given or there is no cnfiguration for the given Country Code, the Script should Report an Error. If data could not be extracted, it should save the URL to a table for further optimization of the script trough configuration or future development. The script should extract at least 85% of the data in a random selection of 500 URLs for each of the 5 country codes. Script should be documented very well and I need a written explantation for the configuration files.Finaly I need a second script to compare the perfoemance of two different configuration files for the script above. This script should take URLs and Country Codes from an own MySQL table and run the extraction for both configuration files. It should write the extracted data to two different tabels and compare how many data objects could be extracted (e.g. counting phone numbers in table a and b and so on) and should show database rows with different values in a table view highlighting the differences. This script is just to optimize configuration files and needs just basic GUI Design.

    [more]
  • $5222 USD Today

    I need someone who can visit merchant websites and compile a list of brands that are sold at each retailer. The total amount of merchants we have is 47 and the list is attached.This can be scraped or it can be hand-researched. Please reply in your proposal with the method that you"ll be using.Please note that not every merchant will have a list of brands that are sold. For example, Guess will only sell Guess brand. It is your job to pick the stores from the list that sell a variety of brands and come up with the list.Also in your proposal, please reply with a total price and an estimated timeline for completion.Finally, data must be returned in an excel spreadsheet.

    [more]
  • £4777 GBP Today

    This is the main fund raising efforts of a project that I am doing to raise money to plant some trees in Africa. I am looking for a coder to write the script. The tree has been drawn and will be sent along as artworkIf at all possible, I would like this written so that it can sit on a wordpress page as part of my website - www.treeplantingholidays.comThe project needs to be mysql database driven with integration to paypal.I would like a box somewhere on the screen - possibly bottom left hand corner and large enough to be easily read so that I can put instructions on how to pay and so forth. Text only. To start with there will be ten trees on the page - in a circle- with one large one at the front taking up about 1/3 of the page or more if that does not stand out enough- all in black.There are 1000 leaves per tree, 20 twigs, 10 branches and 1 trunk. The idea is that - leaves, twigs, branches and trunk all cost different prices.And the person comes to the front page and can pick what they want to sponsor from the tree at the front. I also want them to be able to choose one of the other trees if they want to.The idea is that they come on choose to sponsor a twig, click on the twig that they want to sponsor and are taken to paypal to pay for it. X + Y coordinates are remembered. Once they have paid, they need to be taken to a registration screen, where they are then able to put in their details - email address, name they want displayed on the twig, and what they want displayed. I want to give them the option to have the leaf/twig/branch/ trunk with a clickable link once they have paid.I also want to collect their data in the database.They are able to upload a graphic if they have bought a trunk or a branch,leaf or a twig. Image size preferred info must be supplied so that they know what size image to upload.Once they have paid, the item that they have paid for - so a leaf or a twig, changes to the colour it was before so green or brown. This will tell new visitors that those items have been bought, and also gives the visitor the opportunity to hover over the paid item, and see who has bought it…The leaf or branch expands when you hover over it to display the image. Once the tree at the front of the page has filled up more than 90%, it is rotated to the left, and a new tree is put in place. I also want an FAQ page and a who has paid page. If this is not able to be put on wordpress, I would like those two pages added. If at all possible, when someone enters the information that they would like displayed, it also gets added to the page which will list participants as follows:TRUNKS-John Smilth of www.johnsmith.com --Jane Doe of www.janedoe.cometc-BRANCHES-John Brown of www.johnbrown.com-Jane Smith of www.janesmith.comTWIGS Juliet Green of www.julietgreen.comJonathan Porridge of www.jopo.comLEAVES etc etc etc I also would like a page to display “Our Sponsors” Logos and links to their sites or a section under the trees.Also, needs it to be mobile friendly - as responsive as it can be. If you have any questions at all, please do not hesitate to ask.

    [more]
  • $35 USD Today

    BUDGET $30Have a data base of 14 million usa businesses that need to be separated in smaller partsand then uploaded on a wordpress directory web siteNeed to separate only 100 categories for now Example AC RepairASID Interior Decorators & DesignersAdjustable BedsAir Cleaning & Purifying Equipment DealersAlarm ServicesAlarm Systems DealersAnimal SheltersAnimal TrappersAntique DealersAppliance Parts & Supplies DealersAppliance RepairAppliance RepairAppliancesAquariums & Supplies RetailArboristsAutomatic GatesBaby Furniture RetailBabysittingBamboo, Rattan, & Wicker Home FurnishingsBar StoolsBarbecue Equipment & Supplies RetailBathtubs & Sinks Repair & RefinishingBathtubs RetailBedding RetailBedroom FurnitureBeds RetailBee Control & Removal ServicesBirds & SuppliesBlinds Installation, Cleaning, & RepairBlinds Retail & CustomBurglar Alarms Installation, Service, & RepairCable TVCanopiesCarpetCarpet CleaningCarpet InstallationCarpet, Rug, & Upholstery ServicesCarpet, Rug, & Upholstery Storage & RepairCarpets & Rugs Wholesale & ManufacturersCeiling FansChildren"s Furniture StoresChimney CleaningCleaningCleaning Equipment & Supplies RetailClosed Circuit SystemsContemporary & Modern Furniture StoresCurtainsDecorative & Specialty ConcreteDinette SetsDishwasher Sales & ServiceDog & Cat Supplies & ServicesDog TrainingDomestic ServicesDoor & Gate Operating Devices RepairDrain CleaningDrain ServicesDrapery & Curtain Fabrics RetailDuct CleaningElectric CompaniesFencingFire Alarm Sales & ServiceFire Alarms Service & RepairFire ExtinguisherFireplace & Chimney Building & RepairFireplace AccessoriesFirewoodFlowers WholesaleFurnacesFurnitureFurniture CleaningFurniture Dealers" ShowroomsFurniture Designers & Custom BuildersFurniture Rental & LeasingFurniture Repair

    [more]
  • $222 USD Today

    I need the underlying information from the subsites following webppages:http://zorgverleners.agisweb.nl/index/17/gelderland http://zorgverleners.agisweb.nl/index/17/overijsselhttp://zorgverleners.agisweb.nl/index/17/noord-brabantI need the names of the contact plus the above information (if provided) for all the contacts under the upper three links. And I want to be able to check it out periodically to be able to keep the information up to date. See attached document for more information.Perhaps the following link helps for an easier way to collect/crawl thise data? http://213.132.176.190/VGZ/zoek2.asp Would like to discuss the options.

    [more]
  • $455 USD Yesterday

    This will can grow, but want to start with a smaller budget and see the results. Objective is to scrape various grocery sites to collect product catalog detail such as product description information including brand, size, price, UPC, images, etc.Project / Role is to develop and maintain (as needed) screen scraping programs. Our existing scrapes are implemented using the screen-scraper.com framework, which is customizable via a Java-like programming language. You are free to use the same platform and iterate on it or start from scratch using any programming language or platform you like. We can provide files of related existing scrapes if that is helpful built using screen-scraper.com framework.Some libraries we might suggest are ones which emulate a Web browser: PHP - https://github.com/fabpot/gouttePython - http://doc.scrapy.org/en/latest/intro/tutorial.html Any work you do on the scrapers, be it on our current screenscraper.com system or a new piece of code you decide to write, we would like to have as part of the arrangement. There are a number of data points to be captured, shown on subsequent pages and in the sample .csv output file.  All scraped data points will be expected to meet certain criteria, for example if we ask you to capture a URL, you must supply us with a valid URL or an empty field if a valid URL could not be captured. The important thing is we don"t want the output from the scraper program to be full of useless data. Required Skills:* Understanding of HTTP concepts including cookies, sessions and the like* Ability to write "scraper" software to crawl Web pages and extract specified dataThe following pages provide an example of the next scrape we are looking to build, and where to pull the data.

    [more]
  • $30 USD 2 days ago

    I need someone who has access to premium accounts on company lead services such as ZoomInfo, Data.com, Spokeo, Lead411, InsideView or more and can get me the contact information for employees that includes NAME, PHONE, EMAIL and ADDRESS for a COMPETITIVE price.I need someone with experience in list building and lead generation with access to these sites to gather all or as many names from SPECIFIC COMPANIES in particular locations (mostly the Philadelphia, southern New Jersey and Delaware area). Datamining and web scraping skills are a great plus. Other methods of acquiring leads are also encouraged so long as the records are accurate, complete, and include most of the company.If you have this skills write me a proposal with the following: -Lead services you have access to (i.e. Zoominfo, Data.com, etc.) -Your experience with lead generation and link building -If you have skills in datamining or webscraping or other methods to gather leads -Your COMPETITIVE RATE for 0-100 contacts, 100-250 contacts, 250-500 contacts, and 500+ contacts (FIXED PRICE)I am looking for a long term relationship. If you do a good job and provide a competitive rate you can have the opportunity to work with my entire company and partners over a long term period.Feel free to ask me any questions.

    [more]
  • $277 USD 2 days ago

    I"m looking to build a web crawler that can handle millions of pages, and a database structure that can support a large number of entries. Right now, I only want 3 simple things1. My needs for the crawler are very basic - I want to index content in the body section of web pages, and basic meta information like page title and description. While that"s a really simple task, I"m more concerned with the size of the database and making sure that it doesn"t get so large that it becomes slow. How would you structure a database for something that needs to handle storing millions of pages?2. For the crawler, like I mentioned, I"m indexing entire pages, so this should be very quick. I do have a large number of sites that I want to crawl though (20,000+). Preferred language and framework are Python + Scrapy but if you have experience with another language (like Java) for large scale crawling, I am open to considering other things. I want to scrape anything between HTML body tags and basic meta information, and store the time and date that the page was crawled. No other specifications at this time. The question here is, how long would it take for you to build a crawler that can handle crawling and scraping a large number of web pages?3. I"m thinking in a different direction than I was before, and want to handle any parsing or searching for specific information through code that is separate from the crawler. I want to build an API so specific information is standardized and can be used by other websites. Do you have any experience in this area?Right now, I want to get an idea of how long you see this taking, how you would handle a large dataset, and when you would be available to work on this. Looking only for a detailed estimate here, no code, no other work right now which is why the rate is so low.If I invited you to bid, I"m considering your listed hourly rate, not the price on this project.

    [more]
  • $555 AUD 2 days ago

    Stagecoach Software Pty Ltd (trading as RateMyAgent) has fully functioning ruby Nokogiri scrapers populating a MySQL database from certain websites. The scrapers are based on a VPS and run on daily cron jobs. The scrapers generate thousands of rows of data per day in two key tables plus further rows in related tables. The database is designed to function as a backend to a forthcoming website.This project requires a freelancer to adapt the existing scraper code to operate on a seventh website operating in the same industry and displaying the same type of data.The existing code is available on github. A sample development database is available on Heroku. These may be inspected by approved bidders before they accept the project but development is to take place on another development database on the VPS. All code must be posted to and deployed through github. The new scarper must follow and use existing code as much as practicable.Sundry other minor changes to the code and database may be required as the project progresses to help achieve the corporate objective of providing a useful and functioning website. The freelancer must ensure that the current production database and existing scrapers continue to operate without interference throughout the project. Fixing bugs identified within a week after production are to be included in the price.Further written instructions including identification of the site to be scraped will be provided to approved bidders. Bidders are free to revise their bids upon studying the instructions, the code or the existing database.This project follows freelancer Project ID’s 5477803, 5129765, 5066285, 4975478, 5799583, 6103569 and 6195613. Further similar scraping projects for other sites will be offered to the successful bidder if this project is completed successfully. Bidders without demonstrable experience in Ruby, MySQL and Web Scraping will be rejected. Bids placed within minutes of the project being posted will also be rejected as indicative of no thought being applied to the project requirements.

    [more]
  • £311 GBP 2 days ago

    I will regularly have documents and receipts that I need to be scanned and then forwarded to my P.A..You must be local to Broadstairs in Kent to either collect the paperwork or to have it dropped off to you.You will need to have a background check and preferably be a more mature person

    [more]
  • $944 USD 2 days ago

    I have a client that has a searchable database application online that displays aircraft components they are qualified to produce. The database includes several thousand part numbers and Google has crawled this database. Sometimes their target demographic searches Google for a specific part number and their site is typically one of the few sites that comes up for any given part number. They did nothing special to invite Google to crawl the database, it just happened. If you Google "170-00759-405" for instance, one of the results in the top 5 will link to an Applied Composites page with that part number in a list of Embraer parts they are approved to make.Okay, now they can also make tens of thousands of other aircraft parts, but they can only list them once they build the part and self-certify. So they are considering creating a separate database and a simple interface to populate it with parts they are not yet approved to make, but COULD make upon request. Their hope is that Google would crawl and catalog this database, like the other one, so that when their target demographic searches, that they are taken to the site and told that they can make it even though it is not yet in the certified list. We are looking for a way to add that second database AND to create a more elegant landing experience. Right now the search links just go to a random list of parts presumably generated and harvested by the GoogleBots. However, it would be better if a part number search took them to a landing page presenting a clearer explanation and call to action. Like, "Looking for Embraer 170-00759-405? We can build that! We are currently FAA Certified to produce that Part. Call now to speak to a Representative." Or if the are NOT Certified, it would not say FAA Certified, but would still invite a phone call. Anyway, looking for a proposal on how to accomplish this goal. No idea what budget is appropriate at this point. So I am just putting a standard range. Feel free to propose something outside of that range. Just need to understand what you propose, cost and schedule. Thank you.

    [more]
  • $333 USD 3 days ago

    Looking for someone to take a list of 500 Arizona construction companies and research extensively to find contacts. I don"t have the contacts, just the company names. Looking for CEOs, CFOs, Risk Managers, Office Managers, Owners, etc. Email, phone, direct links to social media profiles, etc. This will require manual research, not data scraping. Multiple contacts per company to be identified (I only have the name of the company) and researched. Would like to see regular updates to confirm accuracy and begin working with the list as it grows. This will take some time.

    [more]
  • $8888 USD 3 days ago

    We want to set up a web and product search resembling Google for a specific segment of some few hundred websites and shops. There are 4 content types:- Web pages (to be crawled e.g. by Nutch)- Products (supplied as XML, Formats: Google Shopping, Amazon, more XML+CSV formats later)- Classifieds (scraping of 6 websites, splitting up approx. 15 fields like start-/end date, price, title, description)- Pictures (OPTIONAL, may be dropped initially - store URL and alt-tag content only)The content will be in 4 languages: EN, DE, FR, IT (more to follow), content language one fieldThe search index will be stored in structured form with up to 20 fields, slightly varying by content type. Web data will need to identify rich snippets with product, breadcrumb and review data. Product data will be loaded daily. Classifieds need to be scraped at least daily. The search engine UI shall get a UI somewhat close to Google search (AJAX-based), providing 4 search areas: Web, Shopping, Classifieds, Images. Classifieds are formatted like Web pages with a mid-sized (approx. 180 px x 150 px) image. The UI will be prepared for translation. The URLs to crawl and the filenames to import for shopping data will be preset in this version (also: no personalisation, LBS, ...).The service (crawler, search engine and web UI) is to be set up with a cloud service (AWS, Azure,..) to be agreed upon.This is an attempt to have the entire service developed in one job. We may split it up depending on your feedback. Please state your experience with the technology involved.

    [more]
  • $155 USD 4 days ago

    I want to have the ability to pull data from the internet in order to create local business directories, perhaps real estate directories, etc.As this service is offered, I assume the software exists. I"m looking for someone to direct me to the right software and then train me on it so I can do it myself in the future.

    [more]
  • $255 USD 4 days ago

    I need help crawling craigslist now that the show contact info button is on every posting instead of the actual phone number. I used web data collector to crawl other sites in the past, but not craigslist. doesn"t anyone know how with this new format?

    [more]
  • $1333 USD 4 days ago

    Can you scrape sites and build Wordpress pages from the results using WP-ALL-IMPORT? I want someone who has experience of Custom Post types, scraping and building killer page templates. You must understand how to manipulate the databases.We will be scraping yacht charter sites to get descriptions of destinations, suppliers, etc.

    [more]
  • $1055 USD 5 days ago

    I Need a website that will show the benefits of wake up now that people will call me after seeing my many videos on youtube, wanting to make residual income at home.. I"ve got the domain I"ve got Everything! Just a site so people go to it see amazing collect leads or call me up all the time To Get In On This Awesome Opportunity!! Call me up and discuss opportunity while im on the run, on the phone in the car..

    [more]
  • $1444 USD 5 days ago

    Need a java developer to create a java agent. should have experience with instrumentation APIs to collect all the metrics from JVM.

  • $1422 USD 5 days ago

    I am looking for a programmer to help build a piece of software to be integrated with Linux and with the following features/functionality:- Ability to upload documents including different file types (PDF, .doc, jpeg, bmp, png, etc) and compress.- Enable OCR to read and extract certain defined information from those documents.- Using that extracted data, output compressed and standardized PDF"s, than categorize using pre-determined naming & file structure.

    [more]
  • [Sealed] 5 days ago

    [This is a Private Project. You must be logged in to view the Project Description]

  • $155 USD 6 days ago

    Need a spider script that crawls a list of domains and saves gathered information to a flat file. Information to be obtained is 1) record of external links on each domain in the list 2) record of anchor text or image alt text of each external link 3) google page rank of domain. Script should be written in php and saved server side to a flat file (csv or similar). Item "3" is optional, but items "1" and "2" are required.

    [more]
    lenzai does not have any open projects.
    lenzai does not have any work in progress.
[see more]

Portfolio

[see more]

Résumé

Experience

software engineering consultant

Jul 2009 - Present (5 years)

Cloverise Limited

data extraction, web automation, robots, spiders , crawlers. On demand services.<br /><br />consulting for SCRUM, LEAN start up and training developers

Software engineering manager

Sep 2004 - Apr 2009 (4 years)

SFR Business Team

team leader

Jul 2000 - Aug 2004 (4 years)

Oxone Technologie

Education

master's degree

Institut national polytechnique de Grenoble

1997-2001