Abstract
With the demand for machine learning increasing, so does the demand for tools which make it easier to use. Automated machine learning (AutoML) tools have been developed to address this need, such as the Tree-Based Pipeline Optimization Tool (TPOT) which uses genetic programming to build optimal pipelines. We introduce Layered TPOT, a modification to TPOT which aims to create pipelines equally good as the original, but in significantly less time. This approach evaluates candidate pipelines on increasingly large subsets of the data according to their fitness, using a modified evolutionary algorithm to allow for separate competition between pipelines trained on different sample sizes. Empirical evaluation shows that, on sufficiently large datasets, Layered TPOT indeed finds better models faster.
Original language | English |
---|---|
Title of host publication | Proceedings of the International Workshop on Automatic Selection, Configuration and Composition of Machine Learning Algorithms (AutoML 2017), 10 August 2017, Sydney, Australia |
Subtitle of host publication | Collocated with ECMLPKDD 2017 |
Editors | Pavel Brazdil, Joaquin Vanschoren, Frank Hutter, Holger Hoos |
Publisher | CEUR-WS.org |
Pages | 49-68 |
Number of pages | 20 |
Publication status | Published - 22 Sept 2017 |
Event | 2017 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2017) - Skopje, Macedonia, The Former Yugoslav Republic of Duration: 18 Sept 2017 → 22 Sept 2017 http://ecmlpkdd2017.ijs.si/index.html |
Publication series
Name | CEUR Workshop Proceedings |
---|---|
Volume | 1998 |
ISSN (Print) | 1613-0073 |
Conference
Conference | 2017 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2017) |
---|---|
Abbreviated title | ECML PKDD 2017 |
Country/Territory | Macedonia, The Former Yugoslav Republic of |
City | Skopje |
Period | 18/09/17 → 22/09/17 |
Internet address |