Nutch custom search application wanted jobs

Filter

My recent searches
Filter by:
Budget
to
to
to
Type
Skills
Languages
    Job State
    264 nutch custom search application wanted jobs found, pricing in USD

    We are looking for openephyra and fusion experts for designing a search engine architecture. We will provide you the base architecture documentation. The candidate should be expert in spark, scala, lucene, solr, UIMA, zookeeper, kafka, nutch, open NLP and Apache Mahout.

    $30 - $250
    $30 - $250
    0 bids

    hello, having a apache solr, nutch, and Hadoop setup and I need help to crawl big scale Crawldb. Now the crawl takes to long time because of the 7.8M crawlDB which should be even larger and then it's indexed in Solr. first help need is for the nutch tuning and then Solr tuning. Do you have experience of working with that?

    $15 / hr (Avg Bid)
    $15 / hr Avg Bid
    1 bids

    I want to crawl huge website and i want to index to apache solr. Tasks need to be done: Crawling Ranking Indexing Recrawling( how it goes) Rank changing depends upon the requirements Optimization Please approach if you have prior experience and need to be done ASAP.

    $12 / hr (Avg Bid)
    $12 / hr Avg Bid
    2 bids

    I need you to develop some software for me. I would like this software to be developed . Build a specialized search engine using elastic search and apache nutch

    $180 (Avg Bid)
    $180 Avg Bid
    6 bids

    Have to crawl the data and store it to HDFS using Apache nutch with the integration of Hadoop!

    $244 (Avg Bid)
    $244 Avg Bid
    6 bids

    Want to extract files from ajax loading page using nutch

    $9 - $23
    $9 - $23
    0 bids

    ... For an upcoming we need a website search and a special search based on elasticsearch. Website: - Setup elastic search for a website - At the end we will have around 17 different websites with the same functionality but they need to have separate indexes. - We need a crawler to crawl the websites (Possibly nutch) - Languages should be identified and

    $2207 (Avg Bid)
    $2207 Avg Bid
    1 bids

    For an upcoming we need a website search and a special search based on elasticsearch. Website: - Setup elastic search for a website - At the end we will have around 17 different websites with the same functionality but they need to have separate indexes. - We need a crawler to crawl the websites (Possibly nutch) - Languages should be identified

    $3212 (Avg Bid)
    $3212 Avg Bid
    9 bids

    I need a nutch installation and configuration, to set up a small search engine.

    $10 - $30
    $10 - $30
    0 bids

    Hello all, I need of a distributed web crawler + indexing, that can take care of crawls of any size. For example the crawler must be able to crawl & indexing...the crawler must be able to crawl & indexing a single website (few web pages) as well as the whole web (over a billion web pages). Installation & configuration : Apache Nutch Thank you

    $41 (Avg Bid)
    $41 Avg Bid
    4 bids

    We need a Apache Nutch process built to monitor price data on competitor and/or vendor websites and feed it into some type of reporting or integration with our catalog for updates. We are open to suggestions on how we attack this solution.

    $430 (Avg Bid)
    $430 Avg Bid
    15 bids

    Im looking to have a backend with cron that can search in 2 sites a list of sentences and scrap results out of it, skipping some values i dont need and adding in a database the scrapped results, been able to catch hashs so data will be updated. I would like to use docker and hadoop with nutch. Let me know if we cab start working together

    $250 (Avg Bid)
    $250 Avg Bid
    1 bids

    Looking for blogger-developers with lots of experience writing about Elastic Search. You will be invited as a guest blogger and paid for your contributions. Our content director will work with you on topic outlines, and put your submissions through a funnel where it will be edited for English narrative and technical competency. When your piece is published

    $287 (Avg Bid)
    $287 Avg Bid
    15 bids

    I am experimenting with apache Nutch and Solr to crawl specific websites and then index them in solr. Later i want to be able to retrive the content from solr using search queries

    $176 (Avg Bid)
    $176 Avg Bid
    9 bids

    ...must be able to crawl a single website (few web pages) as well as the whole web (over a billion web pages). We have found three solutions that may fit our use case: - Apache Nutch - Stormcrawler - Heritrix - Mixnode We need someone to go through all these options and provide us with answers to the following questions: - Total cost of ownership for each

    $77 (Avg Bid)
    $77 Avg Bid
    15 bids

    New company logo name: "Costa Rica Green Airways" . We are a charter company that is now opening a sister scheduled airline for domestic and r...on the internet, instagram is carmonair charter, and also facebook. Please try to catch our peace and love vibe and also as the owner loves nature conservation and a top nutch service. Warm Regards

    $100 (Avg Bid)
    Featured Urgent Guaranteed Top Contest
    $100
    1036 entries

    ...setup an ELK server, it will: 1. Crawl the web, where, (a) I should be able to define the URLs to start the crawling from, and limit the crawl space (e.g., search just the configured site, search configured site and linked webpages), and (b) Index all metatags in the document head section. 2. Index Twitter streams, where, (a) I should be able to configure

    $239 (Avg Bid)
    $239 Avg Bid
    3 bids

    Project 1) I need someone to install Apache Nutch and Apache Sorl and index Nutch to Solr. Also provide step by step instructions on the process that will allow me to duplicate the install on another server. Project 2) Create web UI for Solr frontend using Django or other program with admin backend.

    $536 (Avg Bid)
    $536 Avg Bid
    34 bids

    Hi, We are looking for a programmer that can write/configure a webcrawler to crawl a website and retrieve the records list. We are thinking to use Apache Nutch (with selenium) to do the crawling (other possible). These records need to be parsed, so the information (id, title, introtext, date,...) can be stored in a database. If this job is done

    $171 (Avg Bid)
    $171 Avg Bid
    14 bids

    The whole requirement to build a job search engine e.g. [login to view URL] Possibly having capability to grab jobs from any type of sites. Points to consider: Suggest between real time crawl, or say delay of up to 24h whats feasible. Writing screen scrapping rules for each web site/ group ..or suggest. Sites change and xpath's become invalid. Some kind

    $92 (Avg Bid)
    $92 Avg Bid
    2 bids

    Hi attilapados, I am building a setup where I use Nutch for crawling websites. Using hadoop, Solr and Nutch and I want to optimize Nutch for the search and I came across your profile. Hope that you maybe can help me. Thanks Niels

    $15 / hr (Avg Bid)
    $15 / hr Avg Bid
    1 bids

    We need a Nutch Specialist for Configure the software v1.12 for crawl Outlinks recursively based on seed list. The result will be indexed into solr with only : url, http code

    $175 (Avg Bid)
    $175 Avg Bid
    5 bids

    ...to build on wordpress a powes search engine. The related engine will first index local websites and retrieve them as result based on defined categories. At second time, the search engine will also acts as advertiser. The website layout will be similar as the one of google with few differences. The smart power search will obligatory be built on top

    $477 (Avg Bid)
    $477 Avg Bid
    40 bids

    Hi, i need to install on linux and setup Web crawler based on Apache Lucene or Nutch or any solution suitable for me. It should work with any golden page catalog and should be able to save important information and then according keywords search appropriete URL on bing or yahoo. also from list of urls engine should indentify keywords from menu

    $6 / hr (Avg Bid)
    $6 / hr Avg Bid
    4 bids

    A search engine with Apache Nutch and use MongoDB as the data-store. The web crawler will search in facebook my friends location (check-ins, where they are living, where they are now), and store the location data (latitude - longitude) in mongodb. The web crawler will run automatically and update or insert my friends informations in mongodb.

    $243 (Avg Bid)
    $243 Avg Bid
    11 bids

    ...project is to build on wordpress, a power local search engine. The main goal of the search engine is first of all to index (crawling/scraping) all local existing websites and retrieve them as search-results based on defined categories. And at second point, the search engine will also acts as advertiser. The search engine will have a similar layout as google

    $316 (Avg Bid)
    $316 Avg Bid
    2 bids

    ...built on top of apache Solr. But we need an apache solr and nutch expert to implement the Solr/nutch part. So, I wonder if you could be available for this task? **** Project ****** The purpose of this small project is to build on wordpress, a power local search engine. The main goal of the search engine is first of all to index (crawling/scraping) all

    $150 (Avg Bid)
    $150 Avg Bid
    1 bids

    ...project is to build on wordpress, a power local search engine. The main goal of the search engine is first of all to index (crawling/scraping) all local existing websites and retrieve them as search-results based on defined categories. And at second point, the search engine will also acts as advertiser. The search engine will have a similar layout as google

    $300 (Avg Bid)
    $300 Avg Bid
    1 bids

    The purpose of this small project is to build on wordpress power local search engine in french language for the first stage. The goal of the search is first to index local websites and retrieve them as results based on defined [login to view URL] second point, the search engine will also acts as advertiser. The website layout will be similar as the one

    $460 (Avg Bid)
    $460 Avg Bid
    21 bids

    ...project is to build on wordpress power local search engine in french language for the first stage. The main goal of the search engine is first of all to index local websites and retrieve them as results based on defined categories. And at second point, the search engine will also acts as advertiser. The search engine will have a similar layout as google

    $331 (Avg Bid)
    $331 Avg Bid
    35 bids

    ...project is to build on wordpress power local search engine in french language for the first stage. The main goal of the search engine is first of all to index local websites and retrieve them as results based on defined categories. And at second point, the search engine will acts as advertiser. The search engine will have a similar layout as google with

    $250 (Avg Bid)
    $250 Avg Bid
    1 bids

    ...SetEnvIfNoCase User-Agent ([login to view URL]|binlar|casper|checkpriv|choppy|clshttp|cmsworld|diavol|dotbot|extract|feedfinder|flicky|g00g1e|harvest|heritrix|httrack|kmccrew|loader|miner|nikto|nutch|planetwork|postrank|purebot|pycurl|python|seekerspider|siclab|skygrid|sqlmap|sucker|turnit|vikspider|winhttp|xxxyy|youda|zmeu|zune) bad_bot Order Allow,Deny Allow from All

    $37 (Avg Bid)
    $37 Avg Bid
    17 bids

    ...to solve access to qbox using Nutch xml and accessing elasticsearch. You need skills as follows - 1. Qbox general knowledge 2. curl, ubuntu, xml 3. nutch/elasticsearch The task is only to resolve issues of access to qbox using properties, port, and cluster details. There is no need to understand more about nutch or elasticsearch. You need skills

    $44 / hr (Avg Bid)
    $44 / hr Avg Bid
    1 bids

    ...to implement Nutch and to get Nutch running as a demo to scrap data and store the data in hdfs. You will need skills as follows - 1. Ubuntu 14.04, this is command based with commands to manipulate files, install software, and log data 2. Java 1.7 with Eclipse, understanding classes, methods, debugging, maven, jar files. 3. Nutch, understanding

    $21 / hr (Avg Bid)
    $21 / hr Avg Bid
    21 bids

    Wanted a professional who can make a highly scalable search engine using apache solr , crawler can be made using nutch or any other library

    $253 (Avg Bid)
    $253 Avg Bid
    5 bids

    Wanted a professional who can make a highly scalable search engine using apache solr , crawler can be made using nutch or any other library

    $46 (Avg Bid)
    $46 Avg Bid
    1 bids

    ...installed. Nutch installed. Solr installed. Linux flavour is ubuntu 14.04. The requirement for this job or your task is to do the following below: 1. Do/Fix: Configure Solr to talk to Nutch, that is full integration of solr with nutch. 2. Do/Fix: Configure nutch to integrate with MySQL, that is configured MySQL stores data crawled by nutch - and

    $154 (Avg Bid)
    $154 Avg Bid
    6 bids

    Hello, We are looking for some one to install Apache Solr and integrate Nutch (crawler) on a windows machine. Team viewer access will be given. People with experience only. Regards,

    $25 (Avg Bid)
    $25 Avg Bid
    2 bids

    I want to crawl first install apache nutch 1.9 into my system with solr.. after installation i want working demo of crawling any website and indexing data into solr, As well as i want extract scrap only selected tags to scrap using nutch...dont waste your and my time if u will use hit and trial method on installation.

    $119 (Avg Bid)
    $119 Avg Bid
    1 bids

    ...support, please study all installation guides from apache org [login to view URL] You can install versions that you are familiar with but the required releases must by Nutch 2.x.x, Solr 4.x.x and Hbase 0.9x.x. All installation steps should be documented /written down in word or readme.txt. Script will be tested on clean vps before payment / completion

    $93 (Avg Bid)
    $93 Avg Bid
    14 bids

    Hi ever...Template. We already did the design of our website with Adobe Illustrator and the website plan. We need someone that is very professional, our website need to be top nutch. Here's the theme structure of Crate Joy template : [login to view URL] The Theme file to modify is in attachement. Thanks in advance !

    $405 (Avg Bid)
    $405 Avg Bid
    25 bids

    I'm looking for a freelancer that would be able to set-up a web crawling stack on a CentOS7 server. - Apache Nutch 2.x / Tika - MongoDB (Gora) - Elasticsearch It should be a ready to use solution with a very basic REST service upfront allowing to pass a domain name to launch the process (crawling, parsing, indexing...). Ideally, you are

    $52899 (Avg Bid)
    $52899 Avg Bid
    14 bids

    Secure web based application: web front end in python or java (responsive), back end is elasticsearch. Business rules will generate reports from elasticsearch. Elasticsearch will be fed by Nutch and web based questionnaires. This run on AWS beanstalk. Authy 2FA. Paypal and credit card payment on checkout. More details available with NDA.

    $7860 (Avg Bid)
    $7860 Avg Bid
    28 bids

    the project overall is much larger - but initially I am looking for someone to setup nutch to crawl a set of about 10 websites, and the contents of what is retrieved needs to be stored in an elasticsearch index. lots of guides online how to do this - I just need someone to do the legwork for me. The end deliverable is a document with steps on doing

    $151 (Avg Bid)
    $151 Avg Bid
    12 bids

    I need a Webcrawler to gather sport statistics from a specific website and save that informat...an open-source programme (e.g. Scrapy). Therefore it would be necessary to write the crawler either with Python (I can run it with Scrapy), or with Java (I can run it with Nutch). For a detailed explanation of my wish please find the attached ppt-file!

    $156 (Avg Bid)
    $156 Avg Bid
    32 bids

    Install a server with the following elements: - Apache (or alternate web server) - Apache Tomcat (or alternate java server) - Apache Nutch - Apache Tika

    $107 (Avg Bid)
    $107 Avg Bid
    10 bids

    I am looking for wget type software crawler. it will be in python and be able to crawl sites using specific criteria. Experience with Nutch, Common-crawler or heritrix is preferred.

    $460 (Avg Bid)
    $460 Avg Bid
    10 bids

    I need a developer who can code plugins for me for adding custom logic to parsing and fetching crawled content from a list of pre-defined domains.

    $405 (Avg Bid)
    $405 Avg Bid
    1 bids

    ... I'm interested in stats around how the sites are organized content-wise (the most popular categories and tags), etc... From a crawling standpoint I would prefer either Nutch 2.3 or Scrapy. We then need to parse relevant pieces from the HTML (roughly 10-15 fields that are easily identifiable in the dom), and get the data into an ElasticSearch index

    $480 (Avg Bid)
    $480 Avg Bid
    8 bids