ChartX and ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning

作者
Renqiu Xia,Hancheng Ye,Xiangchao Yan,Qi Liu,Hongbin Zhou,Zijun Chen,Botian Shi,Junchi Yan,Bo Zhang
出处
期刊:IEEE transactions on image processing [Institute of Electrical and Electronics Engineers]
卷期号:34: 7436-7447 被引量:1
标识
DOI:10.1109/tip.2025.3607618
摘要

Recently, many versatile Multi-modal Large Language Models (MLLMs) have emerged continuously. However, their capacity to query information depicted in visual charts and engage in reasoning based on the queried contents remains under-explored. In this paper, to comprehensively and rigorously benchmark the ability of the off-the-shelf MLLMs in the chart domain, we construct ChartX, a multi-modal evaluation set covering 18 chart types, 7 chart tasks, 22 disciplinary topics, and high-quality chart data. Besides, we develop ChartVLM to offer a new perspective on handling multi-modal tasks that strongly depend on interpretable patterns, such as reasoning tasks in the field of charts or geometric images. We evaluate the chart-related ability of mainstream MLLMs and our ChartVLM on the proposed ChartX evaluation set. Extensive experiments demonstrate that ChartVLM surpasses both versatile and chart-related large models, including GPT-4V. We believe that our study can pave the way for further exploration in creating a more comprehensive chart evaluation set and developing more interpretable multi-modal models. Both ChartX and ChartVLM are available at: https://github.com/Alpha-Innovator/ChartVLM.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
牛牛和明明完成签到,获得积分20
刚刚
2秒前
英俊的铭应助Xiao_Fu采纳,获得10
3秒前
Agion完成签到,获得积分10
3秒前
是羽曦呀完成签到,获得积分10
4秒前
5秒前
tzj完成签到,获得积分10
5秒前
胖大海发布了新的文献求助10
6秒前
6秒前
lijf2024完成签到,获得积分10
8秒前
沐黎完成签到,获得积分10
8秒前
9秒前
李健的小迷弟应助吨吨采纳,获得10
11秒前
连国完成签到 ,获得积分10
11秒前
零食宝发布了新的文献求助10
12秒前
12秒前
13秒前
14秒前
年轻的冷雁完成签到,获得积分10
14秒前
15秒前
红姐1993完成签到,获得积分10
15秒前
谦让的含莲完成签到,获得积分10
15秒前
小法师完成签到,获得积分10
16秒前
123456hhh发布了新的文献求助20
16秒前
16秒前
符聪完成签到 ,获得积分10
16秒前
17秒前
星辰大海应助yy采纳,获得10
21秒前
bkagyin应助Hh采纳,获得30
21秒前
有魅力的斑马完成签到,获得积分10
21秒前
22秒前
科研通AI6.2应助俭朴依白采纳,获得10
22秒前
22秒前
23秒前
Bro完成签到,获得积分10
24秒前
Aggie完成签到,获得积分10
24秒前
24秒前
迅速听白完成签到,获得积分20
24秒前
25秒前
25秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Introduction to Helicopter and Tiltrotor Flight Simulation, Second Edition 2500
卤化钙钛矿人工突触的研究 2000
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
Software that combines deep learning,3D reconstruction and CFD to analyze the state of carotid arteries from ultrasound imaging 600
Bounds for Statistical Estimation in Semiparametric Models 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6500736
求助须知:如何正确求助?哪些是违规求助? 8295799
关于积分的说明 17704807
捐赠科研通 5597600
什么是DOI,文献DOI怎么找? 2918421
邀请新用户注册赠送积分活动 1895592
关于科研通互助平台的介绍 1756510