ACIGS: An automated large-scale crops image generation system based on large visual language multi-modal models

计算机科学 人工智能 图像质量 计算机视觉 图像(数学)
作者
Bolong Liu,Hao Zhang,Jie Liu,Qiang Wang
标识
DOI:10.1109/secon58729.2023.10287530
摘要

Smart agriculture requires an extensive convergence of information technology and agriculture. Attaining intelligence mandates an enormous amount of data to train models. However, it is challenging to acquire a large number of crop image data, limiting the application and growth of computer vision technology in agriculture. To address this problem, we designed a crop image generation system that combines a large language model with visual language multi-modal large models to augment the scale, variety, and resolution of crop image data. First, the system inputs existing real crop images into the visual language multimodal model to extract features and represent crop images in text form. Then, the system passes the crop text representation to the language model for cleaning and processing, which generates prompts to create crop images. The prompts are input into the visual language multi-modal model to generate crop images based on text representation of crops. The resulting crop images undergo image quality evaluation in the visual language multimodal model, and high-quality crop images are saved to the crop image dataset based on the quality evaluation. These steps lead to the formation of the final generated crop image dataset. The experimental results indicate that the crop images generated using the proposed system are similar to but different from the example images. This characteristic enables the expansion of crop data while circumventing redundancy and allowing for resolution control, which is crucial for dense segmentation tasks. Using this method, the existing data can be enlarged up to 7.5 times.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
M先生完成签到,获得积分20
1秒前
nannan发布了新的文献求助10
2秒前
wcj完成签到,获得积分10
2秒前
Windycityguy发布了新的文献求助200
3秒前
斯寜应助boshi采纳,获得10
3秒前
xianjingli完成签到,获得积分10
4秒前
SYLH应助非而者厚采纳,获得10
4秒前
ding应助倒数第二采纳,获得10
5秒前
5秒前
萌萌哒的鸡蛋饼完成签到 ,获得积分10
8秒前
8秒前
8秒前
lqm完成签到,获得积分10
8秒前
深情安青应助搬砖人采纳,获得10
8秒前
徐doc完成签到 ,获得积分10
9秒前
9秒前
gwh发布了新的文献求助10
10秒前
Shanice发布了新的文献求助10
10秒前
wkx完成签到,获得积分10
11秒前
彭于晏应助MYZ采纳,获得10
11秒前
期待未来的自己应助boshi采纳,获得10
12秒前
lqm发布了新的文献求助10
12秒前
明亮随阴完成签到,获得积分10
12秒前
科研通AI5应助科研通管家采纳,获得10
13秒前
FashionBoy应助科研通管家采纳,获得10
13秒前
13秒前
czyzyzy完成签到,获得积分10
14秒前
syl发布了新的文献求助10
14秒前
CipherSage应助相濡以沫采纳,获得10
14秒前
珈蓝完成签到,获得积分10
15秒前
昵称完成签到,获得积分10
16秒前
menghuigucha完成签到,获得积分10
17秒前
17秒前
18秒前
心心长点心完成签到,获得积分10
18秒前
波吉完成签到 ,获得积分10
19秒前
难过早晨发布了新的文献求助10
19秒前
19秒前
20秒前
21秒前
高分求助中
Handbook of Diagnosis and Treatment of DSM-5-TR Personality Disorders (2025, 4th edition) 800
Algorithmic Mathematics in Machine Learning 500
Разработка метода ускоренного контроля качества электрохромных устройств 500
Advances in Underwater Acoustics, Structural Acoustics, and Computational Methodologies 400
建筑材料检测与应用 370
Getting Published in SSCI Journals: 200+ Questions and Answers for Absolute Beginners 300
The Monocyte-to-HDL ratio (MHR) as a prognostic and diagnostic biomarker in Acute Ischemic Stroke: A systematic review with meta-analysis (P9-14.010) 240
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3831456
求助须知:如何正确求助?哪些是违规求助? 3373651
关于积分的说明 10480903
捐赠科研通 3093621
什么是DOI,文献DOI怎么找? 1702802
邀请新用户注册赠送积分活动 819198
科研通“疑难数据库(出版商)”最低求助积分说明 771284