Activities per year
Abstract
Signature extraction is a key part of forensic log analysis. It involves recognizing patterns in log lines such that log lines that originated from the same line of code are grouped together. A log signature consists of immutable parts and mutable parts. The immutable parts define the signature, and the mutable parts are typically variable parameter values. In practice, the number of log lines and signatures can be quite large, and the task of detecting and aligning immutable parts of the logs to extract the signatures becomes a significant challenge. We propose a novel method based on a neural language model that outperforms the current state-of-the-art on signature extraction. We use an RNN auto-encoder to create an embedding of the log lines. Log lines embedded in such a way can be clustered to extract the signatures in an unsupervised manner.
Original language | English |
---|---|
Title of host publication | Machine Learning and Knowledge Discovery in Databases |
Subtitle of host publication | European Conference, ECML PKDD 2017, Skopje, Macedonia, September 18–22, 2017, Proceedings, Part III |
Editors | Michelangelo Ceci, Saso Dzeroski, Donato Malerba, Yasemin Altun, Kamalika Das, Jesse Read, Marinka Zitnik, Jerzy Stefanowski, Taneli Mielikäinen |
Place of Publication | Dordrecht |
Publisher | Springer |
Pages | 305-316 |
Number of pages | 12 |
ISBN (Electronic) | 978-3-319-71273-4 |
ISBN (Print) | 978-3-319-71272-7 |
DOIs | |
Publication status | Published - 2017 |
Event | 2017 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2017) - Skopje, Macedonia, The Former Yugoslav Republic of Duration: 18 Sept 2017 → 22 Sept 2017 http://ecmlpkdd2017.ijs.si/index.html |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 10536 LNAI |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 2017 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2017) |
---|---|
Abbreviated title | ECML PKDD 2017 |
Country/Territory | Macedonia, The Former Yugoslav Republic of |
City | Skopje |
Period | 18/09/17 → 22/09/17 |
Internet address |
Keywords
- Information forensic
- Log clustering
- Neural language model
- RNN auto-encoder
- Signature extraction
Fingerprint
Dive into the research topics of 'Unsupervised signature extraction from forensic logs'. Together they form a unique fingerprint.Activities
- 1 Contributed talk
-
European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD)
Stefan Thaler (Speaker)
18 Sept 2017 → 22 Sept 2017Activity: Talk or presentation types › Contributed talk › Scientific