Benchmarking Multi-dimensional AIGC Video Quality Assessment: A Dataset and Unified Model

计算机科学标杆管理质量（理念）质量评定人工智能数据挖掘数据科学评价方法可靠性工程认识论工程类哲学业务营销

作者

Zhichao Zhang,Wei Sun,Xinyue Li,Jun Jia,Xiongkuo Min,Zicheng Zhang,Chunyi Li,Zijian Chen,Puyi Wang,Fengyu Sun,Shangling Jui,Guangtao Zhai

出处

期刊：ACM Transactions on Multimedia Computing, Communications, and Applications [Association for Computing Machinery]
日期：2025-07-22 卷期号：21 (9): 1-24 被引量：7

标识

DOI：10.1145/3749844

摘要

In recent years, AI-driven video generation has gained significant attention due to great advancements in visual and language generative techniques. Consequently, there is a growing need for accurate Video Quality Assessment (VQA) metrics to evaluate the perceptual quality of AI-generated content (AIGC) videos and optimize video generation models. However, assessing the quality of AIGC videos remains a significant challenge because these videos often exhibit highly complex distortions, such as unnatural actions and irrational objects. To address this challenge, we systematically investigate the AIGC-VQA problem in this article, considering both subjective and objective quality assessment perspectives. For the subjective perspective, we construct the L arge-scale G enerated V ideo Q uality Assessment (LGVQ) dataset, consisting of \(2,\!808\) AIGC videos generated by six video generation models using 468 carefully curated text prompts. Unlike previous subjective VQA experiments, we evaluate the perceptual quality of AIGC videos from three critical dimensions: spatial quality, temporal quality, and text-video alignment, which hold utmost importance for current video generation techniques. For the objective perspective, we establish a benchmark for evaluating existing quality assessment metrics on the LGVQ dataset. Our findings show that current metrics perform poorly on this dataset, highlighting a gap in effective evaluation tools. To bridge this gap, we propose the U nify G enerated V ideo Q uality Assessment (UGVQ) model, designed to accurately evaluate the multi-dimensional quality of AIGC videos. The UGVQ model integrates the visual and motion features of videos with the textual features of their corresponding prompts, forming a unified quality-aware feature representation tailored to AIGC videos. Experimental results demonstrate that UGVQ achieves state-of-the-art performance on the LGVQ dataset across all three quality dimensions, validating its effectiveness as an accurate quality metric for AIGC videos. We hope that our benchmark can promote the development of AIGC-VQA studies. Both the LGVQ dataset and the UGVQ model are publicly available on https://github.com/zczhang-sjtu/UGVQ.git .

求助该文献

最长约 10秒，即可获得该文献文件

Benchmarking Multi-dimensional AIGC Video Quality Assessment: A Dataset and Unified Model

今日热心研友