Find Jobs
Hire Freelancers

Develop a Photo Clustering System

£250-750 GBP

In Progress
Posted about 11 years ago

£250-750 GBP

Paid on delivery
The requirement is to build a process/pipeline that can take a table (literately a database table) of information about geographically located photos, and place them into meaningful but subjective groups or clusters. There are many 'dimensions' to the data that could be used to perform the clustering, including geographical coordinates, locality (town/country etc), date taken, textual tags (Folksonomy), and photographer. There is also freeform title and description, but we've already extracted automated terms from these, so don't need to process freeform text. All of these should/could be used to perform the clustering, eg "taken by Joe blogs in April 2012" could be a arbitrary cluster. Clustering should ideally make use of the geographical coordinates, to create clusters of nearby photos (which have some other theme - such as taken by a particular user), but not limited to it, where possible multiple dimensions should be used. The photographer is a good candidate for clustering because often a given photographer will take similar photos in the same geographical area on any given day. It will require two modes, 1) 'priming' where a large number (over 3 million ultimately!) of photos are taken and put into clusters. and 2) 'updates' where batches of images are added (about 1000 at a time), which require placing into the existing clusters or creating new ones. The 'update' mode should aim to where possible add to current clusters , it could delete and then recreate some clusters if how have a better fit, but also needs to be able to create new clusters where needed. In particular, it should be differential, most clusters will remain the same, only a few changing, it shouldn't just delete all the clusters and start again. The two modes are closely related, and will be largely similar probably (eg priming could just be lots of 'updates' with initially no clusters, but there could be some optimization possible to tailor for the two modes. The aim would be to have every photo placed in one or more cluster, and ideally clusters should be somewhere on the order of 5-200 images. If a cluster grows much beyond 200 it should be a candidate for splitting. Ideally each cluster should have a label that describes it eg "photos near Reading" If K-means or similar is used to cluster geographically, it should be an adaptive algorithm, without having to specify K. ie it works out a good number of clusters to create, not aim to create say 30 clusters. [login to view URL]~wilkinson/Applets/[login to view URL] A sample dataset can be supplied (say a table of 120,000 images), but the 'full' data set of 3.4M images could be used too. For a tiny sample, showing the range of columns available, see [login to view URL] It can be written in any language (PHP, Python, Java etc), but needs to be able to run fairly self contained on a Linux server. MySQL would be the ideal backing database (downloading the data from mysql, and creating the clusters in a mysql table) - but others can be considered if offer a tangible benefit (eg postgre/postgis). The full source code - and the means to compile/run it will be required. The eventual aim would be to release the source as opensource. (keep the credit yourself, or assign it to us) To be clear the requirement is not to come up with the perfect clustering system, as noted the clusters are subjective. But to build the framework - with a working clustering method - but so that the exact parameters can be tweaked as required.
Project ID: 4405177

About the project

9 proposals
Remote project
Active 11 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
Awarded to:
User Avatar
Hi I'm a statistician form Uruguay and I have plenty of experience in data mining and data analysis
£935 GBP in 30 days
5.0 (2 reviews)
1.8
1.8
9 freelancers are bidding on average £724 GBP for this job
User Avatar
I can do this task for you, please check PMB.
£525 GBP in 15 days
4.5 (28 reviews)
4.9
4.9
User Avatar
Hello, greatly interested in serving for you, any difficulies will be no problem for me in image clustering.
£750 GBP in 13 days
3.8 (4 reviews)
3.6
3.6
User Avatar
Hi, I can help you.
£735 GBP in 3 days
5.0 (1 review)
2.7
2.7
User Avatar
Hello, We specialize in Image clustering and will be able to complete the task as per your specifications. Please find complete details over PM. We hope to hear from you at the earliest.
£770 GBP in 20 days
5.0 (1 review)
2.4
2.4
User Avatar
I can help you in your project.
£400 GBP in 25 days
5.0 (2 reviews)
1.9
1.9
User Avatar
hire me...!!!! i have done master in cs and working in hcl for past 5 years i can do this you you in 30 day
£750 GBP in 30 days
0.0 (0 reviews)
0.0
0.0
User Avatar
We have developers with skills required to do this project and can provide you best solution in php or python.
£550 GBP in 25 days
0.0 (2 reviews)
0.0
0.0
User Avatar
Hi Barry, I run a company specializing in Machine Learning and Social Media Analysis. I have submitted a detailed approach in my private message. Thank you.
£1,100 GBP in 30 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of UNITED KINGDOM
Ffestiniog, United Kingdom
5.0
3
Payment method verified
Member since Feb 15, 2004

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.