Abstract
Knowledge present in a domain is well expressed as relationships between corresponding concepts. For example, in zoology, animal species form complex hierarchies; in genomics, the different (parts of) molecules are organized in groups and subgroups based on their functions; plants, molecules, and astronomical objects all form complex taxonomies. Nevertheless, when applying supervised machine learning (ML) in such domains, we commonly reduce the complex and rich knowledge to a fixed set of labels, and induce a model shows good generalization performance with respect to these labels. The main reason for such a reductionist approach is the difficulty in eliciting the domain knowledge from the experts. Developing a label structure with sufficient fidelity and providing comprehensive multi-label annotation can be exceedingly labor-intensive in many real-world applications. In this paper, we provide a method for efficient hierarchical knowledge elicitation (HKE) from experts working with high-dimensional data such as images or videos. Our method is based on psychometric testing and active deep metric learning. The developed models embed the high-dimensional data in a metric space where distances are semantically meaningful, and the data can be organized in a hierarchical structure. We provide empirical evidence with a series of experiments on a synthetically generated dataset of simple shapes, and Cifar 10 and Fashion-MNIST benchmarks that our method is indeed successful in uncovering hierarchical structures.
Original language | English |
---|---|
Title of host publication | Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2020, Proceedings |
Editors | Frank Hutter, Kristian Kersting, Jefrey Lijffijt, Isabel Valera |
Publisher | Springer |
Pages | 154-169 |
Number of pages | 16 |
ISBN (Print) | 9783030676605 |
DOIs | |
Publication status | Published - 2021 |
Event | 2020 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2020) - Virtual, Online, Ghent, Belgium Duration: 14 Sept 2020 → 18 Sept 2020 https://ecmlpkdd2020.net/ |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 12458 LNAI |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 2020 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2020) |
---|---|
Abbreviated title | ECML PKDD 2020 |
Country/Territory | Belgium |
City | Ghent |
Period | 14/09/20 → 18/09/20 |
Internet address |
Bibliographical note
Publisher Copyright:© 2021, Springer Nature Switzerland AG.
Keywords
- Active learning
- Deep metric learning
- Hierarchical knowledge elicitation
- Psychometric testing