Find Jobs
Hire Freelancers

Big data entity resolution in NoSQL database

¥240-2000 CNY

Closed
Posted almost 7 years ago

¥240-2000 CNY

Paid on delivery
There’re several collections storing documents containing company entity information. These collections record different information relevant to company entities, such as executives, accountants, products and investment. Based on the type of information gets stored, individual collections are different by their field name and structure, but also share certain overlaps, such as company name, geo location, contact info, industry keyword and official website. Now we need to link documents about the same entity across all different collections, the obstacles we’ve encountered are: 1. Since the source of the data are different, company names belongs to the same entity appeared differently across collections. Since some names are in full name, some are in abbreviation, some are in Pinyin and some are simply initials of English name, it’s hard to completely match documents on the same entity. 2. Different collections contain different fields, and not all collections have contact information and website as fields. All collections may only share company name as the only common field, hence it’s hard to establish a unified matching rule. If we are using the Apache Spark framework to solve this entity resolution problem, what algorithms offer the best performance in terms of precision and feasibility? The largest collection has size around 20,000,000 documents. We need to find an outsource specialist who has done projects or experience in: 1. Big data entity resolution in NoSQL database 2. Over two years experience in Apache Spark and MongoDB Attachment
Project ID: 14290277

About the project

5 proposals
Remote project
Active 7 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
5 freelancers are bidding on average ¥1,195 CNY for this job
User Avatar
Hello. Good to see another serious posting. I don't usually look for new clients but I happened to see your job post and I wanted to contact you. I’ve read your brief and I could absolutely help you with your goal. I have 10+ years experience designing and developing mobile apps for iPhone and Android and building Website so we can get the success of your idea. I would approach your project by starting with wireframes and getting the design completed, before starting the actual development phase. I am highly qualified for this project and would love to speak with you further about taking this project on. If you'd like to view my previous work, take a look at my Freelancer Portfolio. Hope to you call me on chat. Thank you for taking the time to read my application. Cheers, Lang
¥1,244 CNY in 3 days
5.0 (1 review)
3.2
3.2
User Avatar
Hey We are a team of Technical Developers and have got expertise in such stuff. Ping me if you are looking for a quick resolution
¥1,248 CNY in 7 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Hi, this is Fatima. I have been researching and have found two native Spark solutions for your problem, plus Duke. It will work. Best regards. Relevant Skills and Experience I have been working with Spark and Scala for 3 years. Proposed Milestones ¥200 CNY - Analysis ¥800 CNY - Test run, small data set ¥800 CNY - Test run, big data set ¥200 CNY - Project finished Additional Services Offered ¥200 CNY - Program maintenance Would you like an expandable solution? Hire me.
¥2,000 CNY in 14 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of CHINA
China
0.0
0
Member since Jun 9, 2017

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.