Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published 2012 | Published + Submitted + Accepted Version
Book Section - Chapter Open

Crowdclustering

Abstract

Is it possible to crowdsource categorization? Amongst the challenges: (a) each worker has only a partial view of data, (b) different workers may have different clustering criteria and may produce different numbers of categories, (c) the underlying category structure may be hierarchical. We propose a Bayesian model of how workers may approach clustering and show how one may infer clusters/categories, as well as worker parameters, using this model. Our experiments, carried out on large collections of images, suggest that Bayesian crowdclustering works well and may be superior to single-expert annotations.

Additional Information

This work was supported by ONR MURI grant 1015-G-NA-127, ARL grant W911NF-10-2-0016, and NSF grants IIS-0953413 and CNS-0932392.

Attached Files

Published - 4187-crowdclustering.pdf

Accepted Version - crowd_clust_tech_rep.pdf

Submitted - crowd_clust_tech_rep_final.pdf

Files

crowd_clust_tech_rep.pdf
Files (10.7 MB)
Name Size Download all
md5:c78ff901e10c522f19d54bc5c0e80d5f
1.3 MB Preview Download
md5:e7fa5cd0d50bd0cf4b27fffe2310e475
1.3 MB Preview Download
md5:dac5cafdbc6e198d6360fe1a4326ac94
8.2 MB Preview Download

Additional details

Created:
August 19, 2023
Modified:
October 26, 2023