Kamal Nasrollahi

2024

CALLM: Cascading Autoencoder and Large Language Model for Video Anomaly Detection

1 January 2024·3578 words·17 mins

Apostolos Ntelopoulos , Kamal Nasrollahi

Cuhk-Avenue Shanghaitech Ucf-Crime Ubnormal Weakly Supervised Method

This paper introduces a novel cascade system combining a 3D Autoencoder with a Large Visual Language Model (LVLM) for video anomaly detection, leveraging weak supervision and multimodal capabilities to improve detection and explanation of abnormalities.

↑