Liyun Zhu on sis-arxiv-vad-papers

Liyun Zhu on sis-arxiv-vad-papershttps://phuchoang2603.github.io/sis-arxiv-vad-papers/authors/liyun-zhu/Recent content in Liyun Zhu on sis-arxiv-vad-papersHugo -- gohugo.ioenSun, 01 Oct 2023 00:00:00 +0000VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuninghttps://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/vau-r1-advancing-video-anomaly-understanding-via-reinforcement-fine-tuning/Sun, 01 Oct 2023 00:00:00 +0000https://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/vau-r1-advancing-video-anomaly-understanding-via-reinforcement-fine-tuning/Introduces VAU-R1, a reinforcement fine-tuning framework leveraging Group Relative Policy Optimization (GRPO) to enhance multimodal large language models’ (MLLMs) reasoning capabilities in video anomaly understanding (VAU). Develops VAUBench, a comprehensive Chain-of-Thought benchmark with rich annotations across perception, grounding, reasoning, and classification tasks, supported by multiple evaluation metrics including VAU-Eval, QA accuracy, temporal IoU, and Factual Consistency. Demonstrates significant improvements over supervised fine-tuning in question answering accuracy, temporal localization, and interpretability, thereby establishing a scalable, interpretable, and reasoning-aware VAU framework.