Discriminative Clustering by Regularized Information Maximization

Creators: Gomes, Ryan; Krause, Andreas; Perona, Pietro

Others:: Lafferty, J. D.; Williams, C. K. I.; Shawe-Taylor, John; Zemel, R. S.; Culotta, A.

Style

An error occurred while generating the citation.

Abstract

Is there a principled way to learn a probabilistic discriminative classifier from an unlabeled data set? We present a framework that simultaneously clusters the data and trains a discriminative classifier. We call it Regularized Information Maximization (RIM). RIM optimizes an intuitive information-theoretic objective function which balances class separation, class balance and classifier complexity. The approach can flexibly incorporate different likelihood functions, express prior assumptions about the relative size of different classes and incorporate partial labels for semi-supervised learning. In particular, we instantiate the framework to unsupervised, multi-class kernelized logistic regression. Our empirical evaluation indicates that RIM outperforms existing methods on several real data sets, and demonstrates that RIM is an effective model selection method.

Additional Information

©2010 Neural Information Processing Systems. We thank Alex Smola for helpful comments and discussion, and Thanos Siapas for providing the neural tetrode data. This research was partially supported by NSF grant IIS-0953413, a gift from Microsoft Corporation, and ONR MURI Grant N00014-06-1-0734.

Attached Files

Published - 4154-discriminative-clustering-by-regularized-information-maximization.pdf

Files

4154-discriminative-clustering-by-regularized-information-maximization.pdf

Files (982.2 kB)

Name	Size	Download all
4154-discriminative-clustering-by-regularized-information-maximization.pdf md5:514030e9ccd0de315dd394c0116ce5a6	982.2 kB	Preview Download

Additional details

	All versions	This version
Views	0	0
Downloads	0	0
Data volume	0 Bytes	0 Bytes