Projects per year
Organisation profile
Introduction / mission
The chair studies data mining (DM) techniques and knowledge discovery approaches that are at the core of data science. The group is known for its contributions to the areas of predictive analytics, automation of machine learning and networked science, subgroup discovery and exceptional model mining, and similarity computations on complex data. Its research is inspired by theoretical computer science, systems development and real-world applications of (big) data-driven discovery in healthcare, banking, energy, retail, telecom, and education among others.
Organisation profile
We develop generic approaches and specialized techniques that cover a wide range of descriptive, predictive and prescriptive analytics and work effectively with text, image, transactional, graph and time-series data in a responsible manner. E.g. we use Deep Learning methods to develop models for high dimensional heterogeneous, unstructured and evolving data and apply this models to areas such as medical imaging, genomics, anomaly detection and sentiment analysis. We further work on methods for analyzing and explaining the model’s decisions and performance and facilitate effective DM with domain expert in the loop.
Success stories
We have created OpenML: an online collaborative platform for studying machine learning techniques. OpenML is used by almost 2,000 researchers, students, and practitioners world-wide, and contains around 20,000 datasets, 3,000 machine learning workflows, and 1,7 million shared experiments. It has won the Dutch Data Prize, as well as backing from Microsoft Research. It is crucial for the development of automated machine learning that is adopted by companies such as Philips.
Further information at OpenML.org
- NWO RATE-Analytics (with Tilburg University, Rabobank and Achmea) "Next generation predictive analytics for data-driven banking and insurance".
- ImpulseKYC-Analytics (with Rabobank) "Know your customer predictive analytics" project aims at developing approaches for effective DM on heterogeneous and evolving data sources with expert-in-the-loop.
- STW CAPA (with Adversitement and StudyPortals)"Context-aware predictive analytics" advanced the current state of the art in Web analytics.
- NWO Veni "Detection methods for similarity structures in time-dependent data"develops foundations for advanced time series and trajectories clustering.
- H2020 SODA (ICT-2016-1; Big Data PPP) "Scalable Oblivious Data Analytics" facilitates secure DM; together with Crypto group we develop practical approaches for DM with multi-party computation.
Fingerprint
Collaborations and top research areas from the last five years
Profiles
-
Adam Arafan
- Mathematics and Computer Science, Data Mining - Doctoral Candidate
Person: Prom. : doctoral candidate (PhD)
-
Elahe Arani
- Mathematics and Computer Science, Data Mining - Assistant Professor
Person: UD : Assistant Professor
-
Aurélien Boland
- Mathematics and Computer Science, Data Mining - Doctoral Candidate-TA
Person: OWP : University Teacher / Researcher
Projects
- 2 Finished
-
Smart One W&I TKI KPN Flagship
Pechenizkiy, M. (Project Manager) & d'Hondt, T. (Project member)
1/08/18 → 31/07/22
Project: Research direct
-
Interoperability of Heterogeneous IoT Platforms
Mocanu, D. C. (Project member) & Exarchakos, G. (Project Manager)
1/01/16 → 31/12/18
Project: Research direct
-
Advances and Challenges in Meta-Learning: A Technical Review
Vettoruzzo, A. (Corresponding author), Bouguelia, M. R., Vanschoren, J., Rognvaldsson, T. & Santosh, K. C., 1 Jul 2024, In: IEEE Transactions on Pattern Analysis and Machine Intelligence. 46, 7, p. 4763-4779 17 p.Research output: Contribution to journal › Article › Academic › peer-review
Open AccessFile12 Citations (Scopus)3 Downloads (Pure) -
Better trees: an empirical study on hyperparameter tuning of classification decision tree induction algorithms
Gomes Mantovani, R. (Corresponding author), Horváth, T., Rossi, A. L. D., Cerri, R., Barbon Junior, S., Vanschoren, J. & De Carvalho, A. C. P. L. F., May 2024, In: Data Mining and Knowledge Discovery. 38, 3, p. 1364-1416 53 p.Research output: Contribution to journal › Article › Academic › peer-review
4 Citations (Scopus) -
Can Fairness be Automated? Guidelines and Opportunities for Fairness-aware AutoML
Weerts, H., Pfisterer, F., Feurer, M., Eggensperger, K., Bergman, E., Awad, N., Vanschoren, J., Pechenizkiy, M., Bischl, B. & Hutter, F., 2024, In: Journal of Artificial Intelligence Research. 79, p. 640-677 39 p.Research output: Contribution to journal › Article › Academic › peer-review
Open AccessFile28 Downloads (Pure)
Datasets
-
Microscope images of human cancer cell lines (U2OS and HL-60)
Lavitt, F. (Creator), Rijlaarsdam, D. J. (Creator), van der Linden, D. (Creator), Weglarz-Tomczak, E. (Contributor) & Tomczak, J. M. (Creator), Zenodo, 8 Jan 2021
Dataset
-
Random forest models for gene expression experiments in Transformational Machine Learning
Soldatova, L. N. (Creator), King, R. D. (Creator), Davis, A. M. (Creator), Dash, T. (Creator), Vanschoren, J. (Creator), Olier, I. (Creator) & Orhobor, O. I. (Creator), SciLifeLab, 10 Jan 2022
DOI: 10.17044/scilifelab.16837084
Dataset
-
Histopathology data of bone marrow biopsies (HistBMP or HistMNIST)
Tomczak, J. (Contributor), Zenodo, 18 Mar 2018
Dataset
Prizes
-
Best Demo Paper Award of IEEE ICDE 2023
Halstead, B. (Recipient), Koh, Y. S. (Recipient), Riddle, P. (Recipient), Pechenizkiy, M. (Recipient) & Bifet, A. (Recipient), 2023
Prize: Other › Career, activity or publication related prizes (lifetime, best paper, poster etc.) › Scientific
File -
Best Paper Award ICPM 2021
Menkovski, V. (Recipient), Sommers, D. (Recipient) & Fahland, D. (Recipient), 4 Nov 2021
Prize: Other › Career, activity or publication related prizes (lifetime, best paper, poster etc.) › Scientific
-
Best Paper Award of ALA 2022
Sokar, G. (Recipient), Mocanu, E. (Recipient), Mocanu, D. C. (Recipient), Pechenizkiy, M. (Recipient) & Stone, P. (Recipient), 2022
Prize: Other › Career, activity or publication related prizes (lifetime, best paper, poster etc.) › Scientific
-
2020 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2020)
Liu, S. (Organiser)
13 Sept 2020 → 18 Sept 2020Activity: Participating in or organising an event types › Conference › Scientific
-
Machine Learning, better, together.
Vanschoren, J. (Speaker)
8 Dec 2018Activity: Talk or presentation types › Invited talk › Scientific
-
Tutorial on Automatic Machine Learning
Hutter, F. (Speaker) & Vanschoren, J. (Speaker)
3 Dec 2018Activity: Talk or presentation types › Keynote talk › Scientific
Press/Media
-
Reports from University of Technology Sydney Add New Data to Findings in Technology (Reinforcement Learning With Multiple Relational Attention for Solving Vehicle Routing Problems)
1/09/23
1 item of Media coverage
Press/Media: Expert Comment
-
New Gels Research Study Findings Recently Were Reported by a Researcher at Huaqiao University (Drying Process of HPMC-Based Hard Capsules: Visual Experiment and Mathematical Modeling)
Yang, Y. & Yang, Y.-C.
16/06/23
1 item of Media coverage
Press/Media: Expert Comment
-
Huaqiao University Researcher Describes Findings in Plasticizers (Enhancing Pullulan Soft Capsules with a Mixture of Glycerol and Sorbitol Plasticizers: A Multi-Dimensional Study)
Yang, Y. & Yang, Y.-C.
30/05/23
1 item of Media coverage
Press/Media: Expert Comment
Student theses
-
3D Face Reconstruction Using Deep Learning
Jawahar, P. (Author), Medeiros de Carvalho, R. (Supervisor 1), Gallucci, A. (Supervisor 2) & Vanschoren, J. (Supervisor 2), 20 Jan 2020Student thesis: Master
File -
Achieving Long Term Fairness through Curiosity Driven Reinforcement Learning: How intrinsic motivation influences fairness in algorithmic decision making
van der Wee, W. J. (Author), Pechenizkiy, M. (Supervisor 1), Gajane, P. (Supervisor 2) & Kapodistria, S. (Supervisor 2), 28 Aug 2023Student thesis: Master
File -
Activity Recognition Using Deep Learning in Videos under Clinical Setting
Srinivasan, V. (Author), Duivesteijn, W. (Supervisor 1), Papapetrou, O. (Supervisor 2), Zhang, L. (External coach) & Vasu, J. D. (External coach), 28 Jan 2020Student thesis: Master
File