Abstract
Data streams are related to large amounts of data that can continuously arrive with a probability distribution that may change over time. Depending on the changes in the data distribution, different phenomena can occur, like new classes can appear or concept drift can occur in existing classes. Machine Learning algorithms have been often used to model this data. New classes are patterns that were not seen during the training of the current classification model, but appear after some time. Concept drift occurs when the concepts associated with a dataset change as new data arrive. This paper proposes a new algorithm based on kNN that uses micro-clusters as prototypes and incrementally updates the micro-clusters or creates new micro-clusters when novelties are detected. In the online phase, each instance close to a micro-cluster is considered an extension of the micro-cluster, being used to adapt the model to concept drift. The proposed algorithm is experimentally compared with a state-of-the-art classifier from the data stream literature and one baseline. According to the experimental results, the proposed algorithm increases the predictive performance over time by incrementally learning changes in the data distribution.
Original language | English |
---|---|
Title of host publication | Progress in Artificial Intelligence - 19th EPIA Conference on Artificial Intelligence, EPIA 2019, Proceedings |
Editors | Paulo Moura Oliveira, Paulo Novais, Luís Paulo Reis |
Publisher | Springer |
Pages | 448-459 |
Number of pages | 12 |
ISBN (Print) | 9783030302436 |
DOIs | |
Publication status | Published - 2019 |
Externally published | Yes |
Event | 19th EPIA Conference on Artificial Intelligence, EPIA 2019 - Vila Real, Portugal Duration: 3 Sept 2019 → 6 Sept 2019 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 11805 LNAI |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 19th EPIA Conference on Artificial Intelligence, EPIA 2019 |
---|---|
Country/Territory | Portugal |
City | Vila Real |
Period | 3/09/19 → 6/09/19 |
Bibliographical note
Publisher Copyright:© 2019, Springer Nature Switzerland AG.
Keywords
- Concept drift
- Data stream
- Novelty detection
- Online learning