Weakly Supervised on sis-arxiv-vad-papers

Weakly Supervised on sis-arxiv-vad-papershttps://phuchoang2603.github.io/sis-arxiv-vad-papers/categories/weakly-supervised/Recent content in Weakly Supervised on sis-arxiv-vad-papersHugo -- gohugo.ioenThu, 13 Feb 2025 00:00:00 +0000Personalizing Vision-Language Models With Hybrid Prompts for Zero-Shot Anomaly Detectionhttps://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/personalizing_vision-language_models_with_hybrid_prompts_for_zero-shot_anomaly_detection/Thu, 13 Feb 2025 00:00:00 +0000https://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/personalizing_vision-language_models_with_hybrid_prompts_for_zero-shot_anomaly_detection/Introduces AnomalyVLM, a framework leveraging hybrid prompts derived from prior knowledge to enhance zero-shot anomaly detection by personalizing vision-language models, incorporating an anomaly region generator and refiner, and utilizing hybrid prompts for category-specific customization and improved detection performance.PLOVAD: Prompting Vision-Language Models for Open Vocabulary Video Anomaly Detectionhttps://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/plovad_prompting_vision-language_models_for_open_vocabulary_video_anomaly_detection/Fri, 10 Jan 2025 00:00:00 +0000https://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/plovad_prompting_vision-language_models_for_open_vocabulary_video_anomaly_detection/A novel framework (PLOVAD) leveraging prompt tuning on large-scale pretrained image-based vision-language models for open vocabulary video anomaly detection, incorporating domain-specific and anomaly-specific prompts, and a temporal module to detect and categorize both seen and unseen anomalies with limited parameters.CALLM: Cascading Autoencoder and Large Language Model for Video Anomaly Detectionhttps://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/callm_cascading_autoencoder_and_large_language_model_for_video_anomaly_detection/Mon, 01 Jan 2024 00:00:00 +0000https://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/callm_cascading_autoencoder_and_large_language_model_for_video_anomaly_detection/This paper introduces a novel cascade system combining a 3D Autoencoder with a Large Visual Language Model (LVLM) for video anomaly detection, leveraging weak supervision and multimodal capabilities to improve detection and explanation of abnormalities.An Attribute-based Method for Video Anomaly Detectionhttps://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/an-attribute-based-method-for-video-anomaly-detection/Sun, 01 Oct 2023 00:00:00 +0000https://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/an-attribute-based-method-for-video-anomaly-detection/A simple attribute-based approach that represents each object by velocity and pose attributes, combining these with deep representations, and uses density estimation for anomaly scoring, achieving state-of-the-art performance.AVadCLIP: Audio-Visual Collaboration for Robust Video Anomaly Detectionhttps://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/avadclip-audio-visual-collaboration-for-robust-video-anomaly-detection/Sun, 01 Oct 2023 00:00:00 +0000https://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/avadclip-audio-visual-collaboration-for-robust-video-anomaly-detection/A novel weakly supervised framework leveraging audio-visual collaboration to improve the robustness and accuracy of video anomaly detection.Cross-Domain Learning for Video Anomaly Detection with Limited Supervisionhttps://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/cross-domain-learning-for-vad-with-limited-supervision/Sun, 01 Oct 2023 00:00:00 +0000https://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/cross-domain-learning-for-vad-with-limited-supervision/A proposed weakly-supervised framework that incorporates external unlabeled data during training by estimating prediction bias and adaptively minimizing it using predicted uncertainty, to enhance cross-domain generalization in video anomaly detection.TEVAD: Improved video anomaly detection with captionshttps://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/chen_tevad_improved_video_anomaly_detection_with_captions_cvprw_2023_paper/Sun, 01 Jan 2023 00:00:00 +0000https://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/chen_tevad_improved_video_anomaly_detection_with_captions_cvprw_2023_paper/Proposes a framework that utilizes both visual and text features, generated through dense video captions, to enhance anomaly detection performance and explainability in videos.