Ucsd-Ped

2025

Networking Systems for Video Anomaly Detection: A Tutorial and Survey

1 April 2025·21983 words·104 mins

Jing Liu , Yang Liu , Jieyu Lin , Jielin Li , Liang Cao , Peng Sun , Bo Hu , Liang Song , Azzedine Boukerche , Victor C.M. Leung

Cuhk-Avenue Shanghaitech Xd-Violence Ubnormal Ucf-Crime Ucsd-Ped Hybrid Survey

A comprehensive survey and tutorial exploring the assumptions, frameworks, recent advances, applications, and future trends of Networking Systems for Video Anomaly Detection (NSVAD), emphasizing the integration of AI, IoVT, and computing for real-world deployable systems.

Personalizing Vision-Language Models With Hybrid Prompts for Zero-Shot Anomaly Detection

13 February 2025·8885 words·42 mins

Yunkang Cao , Xiaohao Xu , Yuqi Cheng , Chen Sun , Zongwei Du , Liang Gao , Weiming Shen

Cuhk-Avenue Shanghaitech Xd-Violence Ubnormal Ucf-Crime Ucsd-Ped Other Weakly Supervised Semi Supervised Training Free Instruction Tuning Unsupervised Hybrid Other

Introduces AnomalyVLM, a framework leveraging hybrid prompts derived from prior knowledge to enhance zero-shot anomaly detection by personalizing vision-language models, incorporating an anomaly region generator and refiner, and utilizing hybrid prompts for category-specific customization and improved detection performance.

2024

Text-Driven Traffic Anomaly Detection With Temporal High-Frequency Modeling in Driving Videos

17 April 2024·10204 words·48 mins

Rongqin Liang , Yuanman Li , Jiantao Zhou , Xia Li

Cuhk-Avenue Shanghaitech Xd-Violence Ubnormal Ucf-Crime Ucsd-Ped Other Hybrid Other

The paper introduces TTHF, a novel single-stage method aligning video clips with text prompts for traffic anomaly detection. It emphasizes modeling high frequency in the temporal domain to capture dynamic changes in driving scenes, and proposes an attentive anomaly focusing mechanism to enhance detection of various traffic anomalies. The approach leverages visual-text semantic alignment, modeling temporal high frequency, and guided attention mechanisms, achieving superior performance on benchmark datasets.

2023

Video Anomaly Detection in 10 Years: A Survey and Outlook

1 October 2023·18854 words·89 mins

MOSHIRA ABDALLA , SAJID JAVED , MUAZ AL RADI , ANWAAR ULHAQ , NAOUFEL WERGHI

Shanghaitech Xd-Violence Ucf-Crime Ucsd-Ped Other Hybrid Survey

A comprehensive survey exploring deep learning-based video anomaly detection, including emerging paradigms such as weakly supervised, self-supervised, and unsupervised approaches, with a focus on core challenges, feature extraction, supervision schemes, loss functions, regularization techniques, and the potential of vision-language models (VLMs) for enhanced anomaly detection.

VADSK: VIDEO ANOMALY DETECTION WITH STRUCTURED KEYWORDS

1 October 2023·6806 words·32 mins

Thomas Foltz

Ucsd-Ped Shanghaitech Cuhk-Avenue Semi Supervised Instruction Tuning Method

A lightweight, interpretable, two-stage video anomaly detection pipeline employing foundational models for frame description generation and keyword-based classification, achieving comparable performance to state-of-the-art methods with real-time inference and enhanced interpretability.

Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought

1 October 2023·11169 words·53 mins

Chao Huang , Benfeng Wang , Jie Wen , Chengliang Liu , Wei Wang , Li Shen , Xiaochun Cao

Shanghaitech Xd-Violence Ubnormal Ucf-Crime Ucsd-Ped Other Hybrid Method

Proposes a structured Perception-to-Cognition Chain-of-Thought and introduces Vad-Reasoning dataset, along with an improved reinforcement learning algorithm AVA-GRPO, to enhance the deep reasoning capabilities of Multimodal Large Language Models in video anomaly detection and understanding tasks.

SUVAD: Semantic Understanding Based Video Anomaly Detection Using MLLM

1 October 2023·4313 words·21 mins

Shibo Gao , Peipei Yang , Linlin Huang

Ucf-Crime Xd-Violence Shanghaitech Ucsd-Ped Other Semi Supervised Training Free Method

Proposes a training-free video anomaly detection method leveraging multi-modal large language models for semantic understanding of videos, enabling scene generalization, interpretability, and flexible anomaly definition without retraining.

Learning to Understand Open-World Video Anomalies

1 October 2023·11409 words·54 mins

Jiaqi Tang , Hao Lu , Ruizheng Wu , Xiaogang Xu , Ke Ma , Cheng Fang , Bin Guo , Jiangbo Lu , Qifeng Chen , Ying-Cong Chen

Shanghaitech Cuhk-Avenue Xd-Violence Ubnormal Ucf-Crime Ucsd-Ped Other Hybrid Other

Introduces HAWK, a novel framework leveraging interactive large Visual Language Models with explicit and implicit motion modality integration, auxiliary consistency loss, and detailed language annotations for diverse video anomaly scenarios. Demonstrates state-of-the-art performance in video description and question-answering tasks across multiple open-world datasets, with extensive annotated data and generation pipelines to enhance practical anomaly understanding and interaction capabilities.

Language-guided Open-world Video Anomaly Detection

1 October 2023·6686 words·32 mins

Zihao Liu , Xiaoyu Wu , Jianqin Wu , Xuxu Wang , Linlin Yang

Ucf-Crime Xd-Violence Ubnormal Ucsd-Ped Other Semi Supervised Unsupervised Hybrid Application

Proposes a novel open-world VAD paradigm guided by natural language, with a dynamic anomaly definition, regularization strategies, and a large-scale dataset (PreVAD) with multi-level annotations and descriptions. Achieves state-of-the-art zero-shot performance on seven datasets.

Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models

1 October 2023·13548 words·64 mins

Yuchen Yang , Kwonjoon Lee , Behzad Dariush , Yinzhi Cao , Shao-Yuan Lo

Shanghaitech Ucf-Crime Ucsd-Ped Other Hybrid Method

Proposes a rule-based reasoning framework, AnomalyRuler, for video anomaly detection using large language models, enabling fast scenario adaptation with few-normal-shot prompting and enhanced robustness through strategic modules.

A Survey on Video Anomaly Detection via Deep Learning: Human, Vehicle, and Environment

1 October 2023·18514 words·87 mins

Ghazal Alinezhad Noghre , Armin Danesh Pazho , Hamed Tabkhi

Cuhk-Avenue Shanghaitech Xd-Violence Ucf-Crime Ucsd-Ped Other Semi Supervised Unsupervised Instruction Tuning Hybrid Survey

This survey provides a comprehensive overview of deep learning-based Video Anomaly Detection (VAD), covering challenges, methodologies, domain-specific applications, and future research directions across human-centric, vehicle-centric, and environment-centric contexts. It introduces a taxonomy of supervision levels, adaptive learning strategies, and explores diverse application areas including healthcare, public safety, road surveillance, and disaster detection, emphasizing the latest advancements and open challenges.

SlowFastVAD: Video Anomaly Detection via Integrating Simple Detector and RAG-Enhanced Vision-Language Model

1 May 2023·9715 words·46 mins

Zongcan Ding , Guansong Pang , Haodong Zhang , Zhiwei Yang , Yanning Zhang , Peng Wu , Peng Wang , Jing Liu , Fang Shen , Changkang Li

Ucsd-Ped Shanghaitech Xd-Violence Ubnormal Semi Supervised Hybrid Method

Proposes a hybrid framework that integrates a fast anomaly detector with a slow, RAG-enhanced vision-language model to improve efficiency and interpretability in video anomaly detection. It employs a retrieval-augmented reasoning module for better scene-specific adaptation, uses an entropy-based intervention strategy to select ambiguous segments for slow detector analysis, and fuses outputs for robust detection.

TEVAD: Improved video anomaly detection with captions

1 January 2023·7563 words·36 mins

Weiling Chen , Keng Teck Ma , Zi Jian Yew , Minhoe Hur , David Aik-Aun Khoo

Shanghaitech Ucf-Crime Xd-Violence Ucsd-Ped Weakly Supervised Method

Proposes a framework that utilizes both visual and text features, generated through dense video captions, to enhance anomaly detection performance and explainability in videos.

Hierarchical Semantic Contrast for Scene-aware Video Anomaly Detection

1 January 2023·7920 words·38 mins

Shengyang Sun , Xiaojin Gong

Ucsd-Ped Shanghaitech Other Semi Supervised Other

The paper proposes a hierarchical semantic contrast (HSC) method that leverages scene-aware autoencoders, semantic contrastive learning, and motion augmentation for improved scene-dependent and scene-independent video anomaly detection. It incorporates pre-trained video parsing models, hierarchical contrastive learning at scene and object levels, and skeleton-based motion augmentation to make the normal feature representations more compact and discriminative, thereby enhancing anomaly detection performance.

↑