Yiming Wang on sis-arxiv-vad-papers

Yiming Wang on sis-arxiv-vad-papershttps://phuchoang2603.github.io/sis-arxiv-vad-papers/authors/yiming-wang/Recent content in Yiming Wang on sis-arxiv-vad-papersHugo -- gohugo.ioenSun, 01 Oct 2023 00:00:00 +0000Harnessing Large Language Models for Training-free Video Anomaly Detectionhttps://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/zanella_harnessing_large_language_models_for_training-free_video_anomaly_detection_cvpr_2024_paper/Sun, 01 Oct 2023 00:00:00 +0000https://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/zanella_harnessing_large_language_models_for_training-free_video_anomaly_detection_cvpr_2024_paper/Introduces a training-free method for video anomaly detection (VAD) leveraging pre-trained large language models (LLMs) and vision-language models (VLMs). Proposes techniques for caption cleaning, scene description, and anomaly scoring without additional training, demonstrating superior performance on surveillance datasets.Delving into CLIP latent space for Video Anomaly Recognitionhttps://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/delving-into-clip-latent-space-for-video-anomaly-recognition/Sun, 01 Jan 2023 00:00:00 +0000https://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/delving-into-clip-latent-space-for-video-anomaly-recognition/Proposes AnomalyCLIP, a novel method leveraging Large Language and Vision (LLV) models like CLIP, combined with multiple instance learning and a re-centring transformation of the CLIP feature space, to detect and classify video anomalies and recognize anomaly types. Introduces a Selector model with prompt learning and a Temporal Transformer-based model for temporal dependency modeling; demonstrates state-of-the-art performance on multiple benchmarks.