The multidimensional wisdom of crowds
Abstract
Distributing labeling tasks among hundreds or thousands of annotators is an increasingly important method for annotating large datasets. We present a method for estimating the underlying value (e.g. the class) of each image from (noisy) annotations provided by multiple annotators. Our method is based on a model of the image formation and annotation process. Each image has different characteristics that are represented in an abstract Euclidean space. Each annotator is modeled as a multidimensional entity with variables representing competence, expertise and bias. This allows the model to discover and represent groups of annotators that have different sets of skills and knowledge, as well as groups of images that differ qualitatively. We find that our model predicts ground truth labels on both synthetic and real data more accurately than state of the art methods. Experiments also show that our model, starting from a set of binary labels, may discover rich information, such as different "schools of thought" amongst the annotators, and can group together images belonging to separate categories.
Additional Information
©2010 Neural Information Processing Systems. P.P. and P.W. were supported by ONR MURI Grant #N00014-06-1-0734 and EVOLUT.ONR2. S.B. was sup- ported by NSF CAREER Grant #0448615, NSF Grant AGS-0941760, ONR MURI Grant #N00014-08-1-0638, and a Google Research Award.Attached Files
Published - Welinder_NIPS2010_0577.pdf
Files
Name | Size | Download all |
---|---|---|
md5:68d6e2b3d6f05c616462343a2848db57
|
2.5 MB | Preview Download |
Additional details
- Eprint ID
- 49746
- Resolver ID
- CaltechAUTHORS:20140916-124026882
- Office of Naval Research (ONR) Multidisciplinary University Research Initiative (MURI)
- N00014-06-1-0734
- Office of Naval Research (ONR)
- EVOLUT.ONR2.
- NSF
- 0448615
- NSF
- 0941760
- Office of Naval Research (ONR)
- N00014-08-1-0638
- Google Research Award
- Created
-
2014-09-16Created from EPrint's datestamp field
- Updated
-
2020-03-09Created from EPrint's last_modified field
- Series Name
- Advances in Neural Information Processing Systems
- Series Volume or Issue Number
- 23