MT-CMVAD: A Multi-Modal Transformer Framework for Cross-Modal Video Anomaly Detection
Video anomaly detection (VAD) faces significant challenges in multimodal semantic alignment and long-term temporal modeling within open surveillance scenarios. Existing methods are often plagued by modality discrepancies and fragmented temporal reasoning. To address these issues, we introduce MT-CMV...
Saved in:
Main Authors: | Hantao Ding, Shengfeng Lou, Hairong Ye, Yanbing Chen |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2025-06-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/15/12/6773 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Video anomaly detection via cross-modal fusion and hyperbolic graph attention mechanism
by: JIANG Di, et al.
Published: (2025-06-01) -
Elderly Location Monitoring System with LoRa (iLocation)
by: Nurhaziqah Izzati Hamzah, et al.
Published: (2024-05-01) -
LoRa: A Proposed Connectivity Technology for Internet of Things Applications in the Kurdistan Region of Iraq
by: Sarko Salahadin Ahmad, et al.
Published: (2021-12-01) -
DCLMA: Deep correlation learning with multi-modal attention for visual-audio retrieval
by: Jiwei Zhang, et al.
Published: (2025-09-01) -
LoRa Propagation and Coverage Measurements in Underground Potash Salt Room-and-Pillar Mines
by: Marius Theissen, et al.
Published: (2025-06-01)