发布文献求助

Integrating image-based LLMs on edge-devices for underwater robotics

人工智能机器人学水下 GSM演进的增强数据速率计算机视觉图像（数学）计算机科学机器人地质学海洋学

作者

Prabha Sundaravadivel,Preetha J. Roselyn,N. Vedachalam,Vincent I. Jeyaraj,Aparna Ramesh,Aaditya Khanal

标识

DOI：10.1117/12.3014446

摘要

Image-based Large Language Models (LLMs) are AI models that can understand the captured images and generate textual content based on the analysis of images or visual data. Incorporating the LLMs for assessing water quality, pressure, and environmental conditions can help analyze historical data and predict potential risks and threats in underwater environments. This can improve the intervention of autonomous underwater vehicles ( AUV) and remotely operated vehicles ( ROV) during emergencies where the visual data must be interpreted to make informed decisions. While LLMs are primarily associated with processing and generating text, they can be integrated with images through a process known as multimodal learning, where text and images are combined for tasks that involve both modalities. Implementing such frameworks is challenging when deployed in low-power microcontrollers primarily used in monitoring systems. This research proposes evaluating multimodal tokens to enable edge computing in bio-inspired robots to monitor the underwater environment. This can help break down large real-time videos into tokens of text-based instructions associated with the description of images. The mini-robots will transmit the collected "tokens" to the nearest AUV or ROV, where the image-based LLM will be deployed. We propose to evaluate this image-based LLM in our NVIDIA Jetson Nano-based AUV. In the proposed architecture, the mini-robots can move along the length of the water column to capture images of the underwater environment. Our proposed model is evaluated to generate texts for boat and fish images. This proposed framework with integrated image-based tokens can significantly reduce the response time and data traffic in underwater real-time monitoring systems.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

活动

『应助活动周』获奖名单已公布 🔥 (2025-4-2)

更新

『中科院2025期刊分区』已更新 (2025-3-23)

更新

『即时热点』模块已上线 (2025-2-28)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: Aaron发布了新的文献求助10

1秒前; 奋斗的蜗牛的应助被激流勇进采纳，获得10

1秒前; 11发布了新的文献求助10

3秒前; 张泽崇发布了新的文献求助10

6秒前; CodeCraft的应助被xixihaha采纳，获得10

7秒前; 华仔的应助被Ab采纳，获得10

8秒前; 小蘑菇上传了应助文件

9秒前; 香蕉觅云的应助被XMUh采纳，获得20

12秒前; javaxixi完成签到，获得积分20

13秒前; 情怀上传了应助文件

13秒前; 机灵雨发布了新的文献求助10

14秒前; jenningseastera上传了应助文件

17秒前; 烟花的应助被Aaron采纳，获得10

18秒前; 丘比特上传了应助文件

19秒前; 醒了没醒醒发布了新的文献求助10

19秒前; 英俊的铭的应助被水灯霖采纳，获得10

22秒前; 王婧萱萱萱完成签到，获得积分10

22秒前; 孙策完成签到，获得积分10

24秒前; 科研通AI2S上传了应助文件

24秒前; 丘比特的应助被英勇的寒蕾采纳，获得10

25秒前; 吧啦吧啦流金岁月发布了新的文献求助10

25秒前; 醒了没醒醒完成签到，获得积分10

26秒前; hhh完成签到，获得积分10

27秒前; 斯文败类上传了应助文件

29秒前; 冰魂的应助被shangguanyilin采纳，获得50

29秒前; 烟花上传了应助文件

30秒前; 科研通AI5上传了应助文件

30秒前; 完美世界上传了应助文件

31秒前; 英俊的铭上传了应助文件

31秒前; 思源上传了应助文件

33秒前; 学术白菜完成签到，获得积分10

33秒前; 陈豆豆发布了新的文献求助10

34秒前; Aaron发布了新的文献求助10

35秒前; Ahiterin完成签到，获得积分10

35秒前; 学术白菜发布了新的文献求助10

36秒前; nns发布了新的文献求助10

37秒前; 水灯霖发布了新的文献求助10

37秒前; NexusExplorer上传了应助文件

38秒前; 冰魂上传了应助文件

38秒前; 研友_LkKrmL发布了新的文献求助10

39秒前

高分求助中: 【此为提示信息，请勿应助】请按要求发布求助，避免被关 20000; Les Mantodea de Guyane Insecta, Polyneoptera 2500; Computational Atomic Physics for Kilonova Ejecta and Astrophysical Plasmas 500; Technologies supporting mass customization of apparel: A pilot project 450; Cybersecurity Blueprint – Transitioning to Tech 400; Mixing the elements of mass customisation 360; Периодизация спортивной тренировки. Общая теория и её практическое применение 310

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3782142; 求助须知：如何正确求助？哪些是违规求助？ 3327581; 关于积分的说明 10232377; 捐赠科研通 3042529; 什么是DOI，文献DOI怎么找？ 1670040; 邀请新用户注册赠送积分活动 799600; 科研通“疑难数据库（出版商）”最低求助积分说明 758842

今日热心研友

jenningseastera

平常的毛豆

忐忑的黑猫

科研小民工

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通