Kun Qian

2023

Exploring Large Vision-Language Models for Robust and Efficient Industrial Anomaly Detection

1 October 2023·4850 words·23 mins

Proposes a novel approach (CLAD) leveraging large vision-language models with contrastive cross-modal training for improved industrial anomaly detection and localization, enhancing interpretability and robustness.

↑