TEVAD: Improved video anomaly detection with captions
·7563 words·36 mins
Proposes a framework that utilizes both visual and text features, generated through dense video captions, to enhance anomaly detection performance and explainability in videos.
