Binary disease prediction using tail quantiles of the distribution of continuous biomarkers

Michiel H.J. Paus (Corresponding author), Edwin R. van den Heuvel, Marc J.M. Meddens

Onderzoeksoutput: Bijdrage aan tijdschriftTijdschriftartikelAcademicpeer review


In the analysis of binary disease classification, numerous techniques exist, but they merely work well for mean differences in biomarkers between cases and controls. Biological processes are, however, much more heterogeneous, and differences could also occur in other distributional characteristics (e.g. variances, skewness). Many machine learning techniques are better capable of utilizing these higher-order distributional differences, sometimes at cost of explainability. In this study, we propose quantile based prediction (QBP), a binary classification method based on the selection of multiple continuous biomarkers and using the tail differences between biomarker distributions of cases and controls. The performance of QBP is compared to supervised learning methods using extensive simulation studies, and two case studies: major depression disorder (MDD) and trisomy. QBP outperformed alternative methods when biomarkers predominantly show variance differences between cases and controls, especially in the MDD case study. More research is needed to further optimise QBP.

Originele taal-2Engels
Pagina's (van-tot)56-87
Aantal pagina's32
TijdschriftJournal of Nonparametric Statistics
Nummer van het tijdschrift1
Vroegere onlinedatum28 nov. 2022
StatusGepubliceerd - jan. 2023

Bibliografische nota

Publisher Copyright:
© 2022 American Statistical Association and Taylor & Francis.


Duik in de onderzoeksthema's van 'Binary disease prediction using tail quantiles of the distribution of continuous biomarkers'. Samen vormen ze een unieke vingerafdruk.

Citeer dit