MTFL: Multi-Timescale Feature Learning for Weakly-supervised Anomaly Detection in Surveillance Videos

Yiling Zhang, Erkut Akdag (Corresponding author), Egor Bondarev, Peter H.N. de With

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

Detection of anomaly events is relevant for public safety and requires a combination of fine-grained motion information and contextual events at variable time-scales. To this end, we propose a Multi-Timescale Feature Learning (MTFL) method to enhance the representation of anomaly features. Short, medium, and long temporal tubelets are employed to extract spatio-temporal video features using a Video Swin Transformer. Experimental results demonstrate that MTFL outperforms state-of-the-art methods on the UCF-Crime dataset, achieving an anomaly detection performance 89.78% AUC. Moreover, it performs complementary to SotA with 95.32% AUC on the ShanghaiTech and 84.57% AP on the XD-Violence dataset. Furthermore, we generate an extended dataset of the UCF-Crime for development and evaluation on a wider range of anomalies, namely Video Anomaly Detection Dataset (VADD), involving 2,591 videos in 18 classes with extensive coverage of realistic anomalies.

Original languageEnglish
Title of host publicationSeventeenth International Conference on Machine Vision, ICMV 2024
EditorsWolfgang Osten
PublisherSPIE
Number of pages8
ISBN (Electronic)9781510688285
ISBN (Print)9781510688278
DOIs
Publication statusPublished - 24 Feb 2025
Event17th International Conference on Machine Vision, ICMV 2024 - Edinburg, United Kingdom
Duration: 10 Oct 202413 Oct 2024

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume13517
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

Conference17th International Conference on Machine Vision, ICMV 2024
Country/TerritoryUnited Kingdom
CityEdinburg
Period10/10/2413/10/24

Keywords

  • Anomaly detection
  • Surveillance videos
  • Video understanding

Fingerprint

Dive into the research topics of 'MTFL: Multi-Timescale Feature Learning for Weakly-supervised Anomaly Detection in Surveillance Videos'. Together they form a unique fingerprint.

Cite this