TY - JOUR
T1 - Unsupervised representation learning and anomaly detection in ECG sequences
AU - Pereira, João
AU - Silveira, Margarida
PY - 2019
Y1 - 2019
N2 - While the big data revolution takes place, large amounts of electronic health records, such as electrocardiograms (ECGs) and vital signs data, have become available. These signals are often recorded as time series of observations and are now easier to obtain. In particular, with the arise of smart devices that can perform ECG, there is the quest for developing novel approaches that allow to monitor these signals efficiently, and quickly detect anomalies. However, since most data generated remains unlabelled, the task of anomaly detection is still very challenging. Unsupervised representation learning using deep generative models (e.g., variational autoencoders) has been used to learn expressive feature representations of sequences that can make downstream tasks, such as anomaly detection, easier to execute and more accurate. We propose an approach for unsupervised representation learning of ECG sequences using a variational autoencoder parameterised by recurrent neural networks, and use the learned representations for anomaly detection using multiple detection strategies. We tested our approach on the ECG5000 electrocardiogram dataset of the UCR time series classification archive. Our results show that the proposed approach is able to learn expressive representations of ECG sequences, and to detect anomalies with scores that outperform other both supervised and unsupervised methods.
AB - While the big data revolution takes place, large amounts of electronic health records, such as electrocardiograms (ECGs) and vital signs data, have become available. These signals are often recorded as time series of observations and are now easier to obtain. In particular, with the arise of smart devices that can perform ECG, there is the quest for developing novel approaches that allow to monitor these signals efficiently, and quickly detect anomalies. However, since most data generated remains unlabelled, the task of anomaly detection is still very challenging. Unsupervised representation learning using deep generative models (e.g., variational autoencoders) has been used to learn expressive feature representations of sequences that can make downstream tasks, such as anomaly detection, easier to execute and more accurate. We propose an approach for unsupervised representation learning of ECG sequences using a variational autoencoder parameterised by recurrent neural networks, and use the learned representations for anomaly detection using multiple detection strategies. We tested our approach on the ECG5000 electrocardiogram dataset of the UCR time series classification archive. Our results show that the proposed approach is able to learn expressive representations of ECG sequences, and to detect anomalies with scores that outperform other both supervised and unsupervised methods.
KW - Anomaly detection
KW - Bioinformatics
KW - Clustering
KW - Data mining
KW - Deep learning
KW - Electrocardiogram
KW - Healthcare
KW - Recurrent neural networks
KW - Representation learning
KW - Time series
KW - Unsupervised learning
KW - Variational autoencoders
UR - http://www.scopus.com/inward/record.url?scp=85070471941&partnerID=8YFLogxK
U2 - 10.1504/IJDMB.2019.101395
DO - 10.1504/IJDMB.2019.101395
M3 - Article
AN - SCOPUS:85070471941
VL - 22
SP - 389
EP - 407
JO - International Journal of Data Mining and Bioinformatics
JF - International Journal of Data Mining and Bioinformatics
SN - 1748-5673
IS - 4
ER -