Skip to main content

Chenting Xu

2025

PLOVAD: Prompting Vision-Language Models for Open Vocabulary Video Anomaly Detection

A novel framework (PLOVAD) leveraging prompt tuning on large-scale pretrained image-based vision-language models for open vocabulary video anomaly detection, incorporating domain-specific and anomaly-specific prompts, and a temporal module to detect and categorize both seen and unseen anomalies with limited parameters.