Samenvatting
Automated Machine Learning (AutoML) systems have been shown to efficiently build good models for new datasets. However, it is often not clear how well they can adapt when the data evolves over time. The main goal of this study is to understand the effect of concept drift on the performance of AutoML methods, and which adaptation strategies can be employed to make them more robust to changes in the underlying data. To that end, we propose 6 concept drift adaptation strategies and evaluate their effectiveness on a variety of AutoML approaches for building machine learning pipelines, including Bayesian optimization, genetic programming, and random search with automated stacking. These are evaluated empirically on real-world and synthetic data streams with different types of concept drift. Based on this analysis, we propose ways to develop more sophisticated and robust AutoML techniques.
| Originele taal-2 | Engels |
|---|---|
| Artikelnummer | 9366792 |
| Pagina's (van-tot) | 3067-3078 |
| Aantal pagina's | 12 |
| Tijdschrift | IEEE Transactions on Pattern Analysis and Machine Intelligence |
| Volume | 43 |
| Nummer van het tijdschrift | 9 |
| DOI's | |
| Status | Gepubliceerd - 1 sep. 2021 |
Bibliografische nota
Publisher Copyright:© 1979-2012 IEEE.
Financiering
The authors would like to thank Erin Ledell, Matthias Feurer and Pieter Gijsbers for their advice on adapting their AutoML systems. This work was supported by the Dutch Science Foundation (NWO) Grant DACCOMPLI (nr. 628.011.022).
Vingerafdruk
Duik in de onderzoeksthema's van 'Adaptation Strategies for Automated Machine Learning on Evolving Data'. Samen vormen ze een unieke vingerafdruk.Citeer dit
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver