<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Chirui Chang on sis-arxiv-vad-papers</title><link>https://phuchoang2603.github.io/sis-arxiv-vad-papers/authors/chirui-chang/</link><description>Recent content in Chirui Chang on sis-arxiv-vad-papers</description><generator>Hugo -- gohugo.io</generator><language>en</language><lastBuildDate>Sun, 01 Oct 2023 00:00:00 +0000</lastBuildDate><atom:link href="https://phuchoang2603.github.io/sis-arxiv-vad-papers/authors/chirui-chang/index.xml" rel="self" type="application/rss+xml"/><item><title>Aligning Effective Tokens with Video Anomaly in Large Language Models</title><link>https://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/aligning-effective-tokens-with-video-anomaly-in-large-language-models/</link><pubDate>Sun, 01 Oct 2023 00:00:00 +0000</pubDate><guid>https://phuchoang2603.github.io/sis-arxiv-vad-papers/papers/aligning-effective-tokens-with-video-anomaly-in-large-language-models/</guid><description>Proposes VA-GPT, a multimodal Large Language Model for video anomaly detection and understanding, utilizing effective token selection and generation modules (SETS and TETG) to improve spatial and temporal localization of anomalies. Introduces instruct-following fine-tuning data and cross-domain benchmarks for robustness evaluation.</description></item></channel></rss>