Robust Correction of Sampling Bias Using Cumulative Distribution Functions

Creators: Mazaheri, Bijan; Jain, Siddharth; Bruck, Jehoshua

Style

An error occurred while generating the citation.

Abstract

Varying domains and biased datasets can lead to differences between the training and the target distributions, known as covariate shift. Current approaches for alleviating this often rely on estimating the ratio of training and target probability density functions. These techniques require parameter tuning and can be unstable across different datasets. We present a new method for handling covariate shift using the empirical cumulative distribution function estimates of the target distribution by a rigorous generalization of a recent idea proposed by Vapnik and Izmailov. Further, we show experimentally that our method is more robust in its predictions, is not reliant on parameter tuning and shows similar classification performance compared to the current state-of-the-art techniques on synthetic and real datasets.

Additional Information

This work is supported by supported by the National Science Foundation Graduate Research Fellowship under Grant No. DGE-1745301, NSF Grant No. CCF-1717884 and The Carver Mead New Adventure Fund.

Attached Files

Submitted - etr149.pdf

Files

etr149.pdf

Files (748.3 kB)

Name	Size	Download all
etr149.pdf md5:8b92f5d247a4b8174cca95a3e957ad52	748.3 kB	Preview Download

Additional details

	All versions	This version
Views	41	41
Downloads	13	13
Data volume	9.7 MB	9.7 MB