Abstract
Cross-validation under sample selection bias can, in principle, be done by importance-weighting the empirical risk. However, the importance-weighted risk estimator produces suboptimal hyperparameter estimates in problem settings where large weights arise with high probability. We study its sampling variance as a function of the training data distribution and introduce a control variate to increase its robustness to problematically large weights.
Original language | English |
---|---|
Title of host publication | 2019 IEEE 29th International Workshop on Machine Learning for Signal Processing, MLSP 2019 |
Place of Publication | Piscataway |
Publisher | Institute of Electrical and Electronics Engineers |
Number of pages | 6 |
ISBN (Electronic) | 978-1-7281-0824-7 |
DOIs | |
Publication status | Published - 5 Dec 2019 |
Event | 29th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2019 - University of Pittsburgh, Pittsburgh, United States Duration: 13 Oct 2019 → 16 Dec 2019 Conference number: 29 https://www.ieeemlsp.cc/ |
Conference
Conference | 29th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2019 |
---|---|
Abbreviated title | MLSP 2019 |
Country/Territory | United States |
City | Pittsburgh |
Period | 13/10/19 → 16/12/19 |
Internet address |
Keywords
- Sample selection bias
- cross-validation