Published 2012
| Published + Submitted + Accepted Version
Book Section - Chapter
Open
Crowdclustering
Chicago
Abstract
Is it possible to crowdsource categorization? Amongst the challenges: (a) each worker has only a partial view of data, (b) different workers may have different clustering criteria and may produce different numbers of categories, (c) the underlying category structure may be hierarchical. We propose a Bayesian model of how workers may approach clustering and show how one may infer clusters/categories, as well as worker parameters, using this model. Our experiments, carried out on large collections of images, suggest that Bayesian crowdclustering works well and may be superior to single-expert annotations.
Additional Information
This work was supported by ONR MURI grant 1015-G-NA-127, ARL grant W911NF-10-2-0016, and NSF grants IIS-0953413 and CNS-0932392.Attached Files
Published - 4187-crowdclustering.pdf
Accepted Version - crowd_clust_tech_rep.pdf
Submitted - crowd_clust_tech_rep_final.pdf
Files
crowd_clust_tech_rep.pdf
Additional details
- Eprint ID
- 45926
- Resolver ID
- CaltechAUTHORS:20140527-173311294
- ONR MURI
- 1015-G-NA-127
- Army Research Laboratory
- W911NF-10-2-0016
- NSF
- IIS-0953413
- NSF
- CNS-0932392
- Created
-
2014-05-28Created from EPrint's datestamp field
- Updated
-
2020-03-09Created from EPrint's last_modified field
- Other Numbering System Name
- CNS Technical Report