Metadata Repositories

Pavel Brazdil (Corresponding author), Jan N. van Rijn, Carlos Soares, Joaquin Vanschoren

Research output: Chapter in Book/Report/Conference proceedingChapterAcademicpeer-review

1 Citation (Scopus)

Abstract

This chapter presents a review of online repositories where researchers can share data, code, and experiments. In particular, it covers OpenML, an online platform for sharing and organizing machine learning data automatically. OpenML contains thousands of datasets and algorithms, and millions of experimental results. We describe the basic philosophy involved, and its basic components: datasets, tasks, flows, setups, runs, and benchmark suites. OpenML has API bindings in various programming languages, making it easy for users to interact with the API in their native language. One important feature of OpenML is the integration into various machine learning toolboxes, such as Scikit-learn, Weka, and mlR. Users of these toolboxes can automatically upload all their results, leading to a large repository of experimental results.

Original languageEnglish
Title of host publicationMetalearning
Subtitle of host publicationApplications to Automated Machine Learning and Data Mining
PublisherSpringer
Chapter16
Pages297-310
Number of pages14
ISBN (Electronic)978-3-030-67024-5
ISBN (Print)978-3-030-67023-8
DOIs
Publication statusPublished - 2022

Publication series

NameCognitive Technologies
ISSN (Print)1611-2482
ISSN (Electronic)2197-6635

Bibliographical note

Publisher Copyright:
© 2022, The Author(s).

Fingerprint

Dive into the research topics of 'Metadata Repositories'. Together they form a unique fingerprint.

Cite this