Jialong Zuo on sis-arxiv-vad-papers

Jialong Zuo on sis-arxiv-vad-papershttps://phuchoang2603.github.io/sis-arxiv-vad-papers/authors/jialong-zuo/Recent content in Jialong Zuo on sis-arxiv-vad-papersHugo -- gohugo.ioenSun, 01 Oct 2023 00:00:00 +0000Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLMhttps://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/holmes-vad-towards-unbiased-and-explainable-video-anomaly-detection-via-multi-modal-llm/Sun, 01 Oct 2023 00:00:00 +0000https://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/holmes-vad-towards-unbiased-and-explainable-video-anomaly-detection-via-multi-modal-llm/A novel framework leveraging multimodal instructions and large-scale datasets to enable unbiased, interpretable, and accurate video anomaly detection with large language models, including a new dataset VAD-Instruct50k with single-frame annotations and explanatory instruction data.Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularityhttps://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/holmes-vau-towards-long-term-video-anomaly-understanding-at-any-granularity/Sun, 01 Oct 2023 00:00:00 +0000https://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/holmes-vau-towards-long-term-video-anomaly-understanding-at-any-granularity/A semi-automated hierarchical video annotation framework combined with a novel Anomaly-focused Temporal Sampler and a multimodal large language model, aimed at comprehensive understanding of complex and long-term video anomalies across multiple temporal scales.