Nutch custom search application wanted jobs

Filter

My recent searches
Filter by:
Budget
to
to
to
Skills
Languages
Job State
243 nutch custom search application wanted jobs found, pricing in USD

...setup an ELK server, it will: 1. Crawl the web, where, (a) I should be able to define the URLs to start the crawling from, and limit the crawl space (e.g., search just the configured site, search configured site and linked webpages), and (b) Index all metatags in the document head section. 2. Index Twitter streams, where, (a) I should be able to configure

$218 (Avg Bid)
$218 Avg Bid
4 bids

Project 1) I need someone to install Apache Nutch and Apache Sorl and index Nutch to Solr. Also provide step by step instructions on the process that will allow me to duplicate the install on another server. Project 2) Create web UI for Solr frontend using Django or other program with admin backend.

$535 (Avg Bid)
$535 Avg Bid
38 bids

Hi, We are looking for a programmer that can write/configure a webcrawler to crawl a website and retrieve the records list. We are thinking to use Apache Nutch (with selenium) to do the crawling (other possible). These records need to be parsed, so the information (id, title, introtext, date,...) can be stored in a database. If this job is done

$167 (Avg Bid)
$167 Avg Bid
15 bids

The whole requirement to build a job search engine e.g. [url removed, login to view] Possibly having capability to grab jobs from any type of sites. Points to consider: Suggest between real time crawl, or say delay of up to 24h whats feasible. Writing screen scrapping rules for each web site/ group ..or suggest. Sites change and xpath's become invalid. Some kind

$92 (Avg Bid)
$92 Avg Bid
2 bids

Hi attilapados, I am building a setup where I use Nutch for crawling websites. Using hadoop, Solr and Nutch and I want to optimize Nutch for the search and I came across your profile. Hope that you maybe can help me. Thanks Niels

$15 / hr (Avg Bid)
$15 / hr Avg Bid
1 bids

We need a Nutch Specialist for Configure the software [url removed, login to view] for crawl Outlinks recursively based on seed list. The result will be indexed into solr with only : url, http code

$190 (Avg Bid)
$190 Avg Bid
6 bids

...to build on wordpress a powes search engine. The related engine will first index local websites and retrieve them as result based on defined categories. At second time, the search engine will also acts as advertiser. The website layout will be similar as the one of google with few differences. The smart power search will obligatory be built on top

$477 (Avg Bid)
$477 Avg Bid
42 bids

Hi, i need to install on linux and setup Web crawler based on Apache Lucene or Nutch or any solution suitable for me. It should work with any golden page catalog and should be able to save important information and then according keywords search appropriete URL on bing or yahoo. also from list of urls engine should indentify keywords from menu

$5 / hr (Avg Bid)
$5 / hr Avg Bid
5 bids

A search engine with Apache Nutch and use MongoDB as the data-store. The web crawler will search in facebook my friends location (check-ins, where they are living, where they are now), and store the location data (latitude - longitude) in mongodb. The web crawler will run automatically and update or insert my friends informations in mongodb.

$225 (Avg Bid)
$225 Avg Bid
11 bids

...project is to build on wordpress, a power local search engine. The main goal of the search engine is first of all to index (crawling/scraping) all local existing websites and retrieve them as search-results based on defined categories. And at second point, the search engine will also acts as advertiser. The search engine will have a similar layout as google

$316 (Avg Bid)
$316 Avg Bid
2 bids

...built on top of apache Solr. But we need an apache solr and nutch expert to implement the Solr/nutch part. So, I wonder if you could be available for this task? **** Project ****** The purpose of this small project is to build on wordpress, a power local search engine. The main goal of the search engine is first of all to index (crawling/scraping) all

$150 (Avg Bid)
$150 Avg Bid
1 bids

...project is to build on wordpress, a power local search engine. The main goal of the search engine is first of all to index (crawling/scraping) all local existing websites and retrieve them as search-results based on defined categories. And at second point, the search engine will also acts as advertiser. The search engine will have a similar layout as google

$300 (Avg Bid)
$300 Avg Bid
1 bids

The purpose of this small project is to build on wordpress power local search engine in french language for the first stage. The goal of the search is first to index local websites and retrieve them as results based on defined [url removed, login to view] second point, the search engine will also acts as advertiser. The website layout will be similar as the one

$460 (Avg Bid)
$460 Avg Bid
21 bids

...project is to build on wordpress power local search engine in french language for the first stage. The main goal of the search engine is first of all to index local websites and retrieve them as results based on defined categories. And at second point, the search engine will also acts as advertiser. The search engine will have a similar layout as google

$331 (Avg Bid)
$331 Avg Bid
35 bids

...project is to build on wordpress power local search engine in french language for the first stage. The main goal of the search engine is first of all to index local websites and retrieve them as results based on defined categories. And at second point, the search engine will acts as advertiser. The search engine will have a similar layout as google with

$250 (Avg Bid)
$250 Avg Bid
1 bids

...SetEnvIfNoCase User-Agent ([url removed, login to view]|binlar|casper|checkpriv|choppy|clshttp|cmsworld|diavol|dotbot|extract|feedfinder|flicky|g00g1e|harvest|heritrix|httrack|kmccrew|loader|miner|nikto|nutch|planetwork|postrank|purebot|pycurl|python|seekerspider|siclab|skygrid|sqlmap|sucker|turnit|vikspider|winhttp|xxxyy|youda|zmeu|zune) bad_bot Order Allow,Deny Allow from All

$36 (Avg Bid)
$36 Avg Bid
18 bids

...to solve access to qbox using Nutch xml and accessing elasticsearch. You need skills as follows - 1. Qbox general knowledge 2. curl, ubuntu, xml 3. nutch/elasticsearch The task is only to resolve issues of access to qbox using properties, port, and cluster details. There is no need to understand more about nutch or elasticsearch. You need skills

$44 / hr (Avg Bid)
$44 / hr Avg Bid
1 bids

...to implement Nutch and to get Nutch running as a demo to scrap data and store the data in hdfs. You will need skills as follows - 1. Ubuntu [url removed, login to view], this is command based with commands to manipulate files, install software, and log data 2. Java 1.7 with Eclipse, understanding classes, methods, debugging, maven, jar files. 3. Nutch, understanding

$21 / hr (Avg Bid)
$21 / hr Avg Bid
22 bids

具体需求: 1.在指定服务器安装nutch和抓取我方会提供的20网址。 2.提供nutch具体安装步骤和使用说明 3.抓取内容可导入mysql或solr等,用于查询 4.提供如何查询抓取内容的说明 交付需求: 希望在3天内完成安装和抓取。 (补充说明,我方可以提供服务器供测试使用)

$35 - $295
$35 - $295
0 bids

Wanted a professional who can make a highly scalable search engine using apache solr , crawler can be made using nutch or any other library

$260 (Avg Bid)
$260 Avg Bid
6 bids