To tune or not to tune : recommending when to adjust SVM hyper-parameters via Meta-learning

R. Gomes Mantovani, A.L.D. Rossi, J. Vanschoren, B. Bischl, A.C.P.L.F. Carvalho

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

55 Citations (Scopus)

Abstract

Many classification algorithms, such as Neural Networks and Support Vector Machines, have a range of hyper-parameters that may strongly affect the predictive performance of the models induced by them. Hence, it is recommended to define the values of these hyper-parameters using optimization techniques. While these techniques usually converge to a good set of values, they typically have a high computational cost, because many candidate sets of values are evaluated during the optimization process. It is often not clear whether this will result in parameter settings that are significantly better than the default settings. When training time is limited, it may help to know when these parameters should definitely be tuned. In this study, we use meta-learning to predict when optimization techniques are expected to lead to models whose predictive performance is better than those obtained by using default parameter settings. Hence, we can choose to employ optimization techniques only when they are expected to improve performance, thus reducing the overall computational cost. We evaluate these meta-learning techniques on more than one hundred data sets. The experimental results show that it is possible to accurately predict when optimization techniques should be used instead of default values suggested by some machine learning libraries
Original languageEnglish
Title of host publication2015 International Joint Conference on Neural Networks (IJCNN), 12-17 July 2015, Killarney, Ireland
Place of PublicationPiscataway
PublisherInstitute of Electrical and Electronics Engineers
Pages1-8
Number of pages8
ISBN (Electronic)978-1-4799-1960-4
ISBN (Print)978-1-4799-1961-1
DOIs
Publication statusPublished - 2015

Fingerprint

Dive into the research topics of 'To tune or not to tune : recommending when to adjust SVM hyper-parameters via Meta-learning'. Together they form a unique fingerprint.

Cite this