A Lazy Man's Approach to Benchmarking: Semisupervised Classifier Evaluation and Recalibration

Creators: Welinder, Peter; Welling, Max; Perona, Pietro

Style

An error occurred while generating the citation.

Abstract

How many labeled examples are needed to estimate a classifier's performance on a new dataset? We study the case where data is plentiful, but labels are expensive. We show that by making a few reasonable assumptions on the structure of the data, it is possible to estimate performance curves, with confidence bounds, using a small number of ground truth labels. Our approach, which we call Semisupervised Performance Evaluation (SPE), is based on a generative model for the classifier's confidence scores. In addition to estimating the performance of classifiers on new datasets, SPE can be used to recalibrate a classifier by reestimating the class-conditional confidence distributions.

Additional Information

Attached Files

Accepted Version - CVPR2013.pdf

Submitted - 1210.2162.pdf

Files

CVPR2013.pdf

Files (2.5 MB)

Name	Size	Download all
CVPR2013.pdf md5:0256246e6548ae78c801495e0aa38146	1.5 MB	Preview Download
1210.2162.pdf md5:74a61eb1f4ef64f5167fd80bbe84f1ea	1.1 MB	Preview Download

Additional details

	All versions	This version
Views	0	0
Downloads	0	0
Data volume	0 Bytes	0 Bytes