You can go to Google Images and type in keywords such as ``apple”, ``bank”, ``cloud” et al
Download the returned images (more than 1000 images), total images will be 100,000.
Extract color histogram or SIFT by using
[url removed, login to view]
Apply AP clustering to get an optimal partition of the returned images
[url removed, login to view]~mdehoon/software/cluster/[url removed, login to view]