Yushu Shi on sis-arxiv-vad-papers

Yushu Shi on sis-arxiv-vad-papershttps://phuchoang2603.github.io/sis-arxiv-vad-papers/authors/yushu-shi/Recent content in Yushu Shi on sis-arxiv-vad-papersHugo -- gohugo.ioenWed, 01 Jan 2025 00:00:00 +0000Ex-VAD: Explainable Fine-grained Video Anomaly Detection Based on Visual-Language Modelshttps://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/3552_ex_vad_explainable_fine_g/Wed, 01 Jan 2025 00:00:00 +0000https://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/3552_ex_vad_explainable_fine_g/The paper introduces Ex-VAD, a comprehensive framework for fine-grained and explainable video anomaly detection that leverages visual-language models (VLMs) and large language models (LLMs). It features modules for generating anomaly explanations, fusing multimodal features for coarse detection, and expanding/aligning labels for fine-grained classification, with improved interpretability and accuracy demonstrated on UCF-Crime and XD-Violence datasets.