强化学习
计算机科学
钢筋
人工智能
人机交互
心理学
社会心理学
作者
Yifan Hu,HU Bin-bin,Bowen Yuan,Hai‐Tao Zhang
标识
DOI:10.1109/icaisisas64483.2025.11051677
摘要
Despite the widespread deployment of Large Language Models (LLMs) in embodied intelligent agents for their promising human-like reasoning capabilities, they remain hindered by local infeasibility and unpredictable cloud latency. To address this challenge, we propose LLM-Coach, a reinforcement learning framework that utilizes LLM-driven reward shaping for navigation on water surface during the training phase. Our approach enables models to integrate the common-sense knowledge embedded in LLMs while eliminating runtime dependency on these resource-intensive models. We implement the proposed framework in the European Ship Simulator environment, introducing two specialized LLM agents: an Observation Agent for real-time state extraction and an Evaluation Agent for policy assessment. Through synergistic collaboration, these agents facilitate the training of a deep neural network-based navigation model (ESSNet) that operates efficiently during execution. Finally, experimental results demonstrate that our approach outperforms traditional reinforcement learning methods while ensuring operational robustness.
科研通智能强力驱动
Strongly Powered by AbleSci AI