Xd-Violence

2023

AVadCLIP: Audio-Visual Collaboration for Robust Video Anomaly Detection

1 October 2023·8733 words·41 mins

Peng Wu , Wanshun Su , Guansong Pang , Yujia Sun , Qingsen Yan , Peng Wang , Yanning Zhang

Xd-Violence Ucf-Crime Shanghaitech Weakly Supervised Hybrid Method

A novel weakly supervised framework leveraging audio-visual collaboration to improve the robustness and accuracy of video anomaly detection.

Anomize: Better Open Vocabulary Video Anomaly Detection

1 October 2023·6692 words·32 mins

Fei Li , Wenxuan Liu , Jingjing Chen , Ruixu Zhang , Yuran Wang , Xian Zhong , Zheng Wang

Ucf-Crime Xd-Violence Hybrid Method

The paper introduces the Anomize framework that addresses detection ambiguity and categorization confusion in open vocabulary video anomaly detection (OVVAD) by leveraging visual and textual data augmentation, dual-stream mechanisms, and label relation guidance, achieving superior performance on multiple datasets.

Aligning Effective Tokens with Video Anomaly in Large Language Models

1 October 2023·8317 words·40 mins

Yingxian Chen , Jiahui Liu , Ruidi Fan , Yanwei Li , Chirui Chang , Shizhen Zhao , Wilton W.T.Fok , Xiaojuan Qi , Yik-Chung Wu

Xd-Violence Hybrid Other

Proposes VA-GPT, a multimodal Large Language Model for video anomaly detection and understanding, utilizing effective token selection and generation modules (SETS and TETG) to improve spatial and temporal localization of anomalies. Introduces instruct-following fine-tuning data and cross-domain benchmarks for robustness evaluation.

A Survey on Video Anomaly Detection via Deep Learning: Human, Vehicle, and Environment

1 October 2023·18514 words·87 mins

Ghazal Alinezhad Noghre , Armin Danesh Pazho , Hamed Tabkhi

Cuhk-Avenue Shanghaitech Xd-Violence Ucf-Crime Ucsd-Ped Other Semi Supervised Unsupervised Instruction Tuning Hybrid Survey

This survey provides a comprehensive overview of deep learning-based Video Anomaly Detection (VAD), covering challenges, methodologies, domain-specific applications, and future research directions across human-centric, vehicle-centric, and environment-centric contexts. It introduces a taxonomy of supervision levels, adaptive learning strategies, and explores diverse application areas including healthcare, public safety, road surveillance, and disaster detection, emphasizing the latest advancements and open challenges.

SlowFastVAD: Video Anomaly Detection via Integrating Simple Detector and RAG-Enhanced Vision-Language Model

1 May 2023·9715 words·46 mins

Zongcan Ding , Guansong Pang , Haodong Zhang , Zhiwei Yang , Yanning Zhang , Peng Wu , Peng Wang , Jing Liu , Fang Shen , Changkang Li

Ucsd-Ped Shanghaitech Xd-Violence Ubnormal Semi Supervised Hybrid Method

Proposes a hybrid framework that integrates a fast anomaly detector with a slow, RAG-enhanced vision-language model to improve efficiency and interpretability in video anomaly detection. It employs a retrieval-augmented reasoning module for better scene-specific adaptation, uses an entropy-based intervention strategy to select ambiguous segments for slow detector analysis, and fuses outputs for robust detection.

TEVAD: Improved video anomaly detection with captions

1 January 2023·7563 words·36 mins

Weiling Chen , Keng Teck Ma , Zi Jian Yew , Minhoe Hur , David Aik-Aun Khoo

Shanghaitech Ucf-Crime Xd-Violence Ucsd-Ped Weakly Supervised Method

Proposes a framework that utilizes both visual and text features, generated through dense video captions, to enhance anomaly detection performance and explainability in videos.

Generating Anomalies for Video Anomaly Detection with Prompt-based Feature Mapping

1 January 2023·8035 words·38 mins

Zuhao Liu , Xiao-Ming Wu , Dian Zheng , Kun-Yu Lin , Wei-Shi Zheng

Shanghaitech Xd-Violence Ucf-Crime Hybrid Method

The paper proposes a prompt-based feature mapping framework (PFMF) to generate unseen anomalies with unbounded types and narrow the scene gap for video anomaly detection, outperforming state-of-the-art methods on multiple datasets.

Delving into CLIP latent space for Video Anomaly Recognition

1 January 2023·11434 words·54 mins

Luca Zanella , Benedetta Liberatori , Willi Menapace , Fabio Poiesi , Yiming Wang , Elisa Riccia

Shanghaitech Ucf-Crime Xd-Violence Semi Supervised Other

Proposes AnomalyCLIP, a novel method leveraging Large Language and Vision (LLV) models like CLIP, combined with multiple instance learning and a re-centring transformation of the CLIP feature space, to detect and classify video anomalies and recognize anomaly types. Introduces a Selector model with prompt learning and a Temporal Transformer-based model for temporal dependency modeling; demonstrates state-of-the-art performance on multiple benchmarks.

↑