Meta Reinforcement Learning for Multi-task Offloading in Vehicular Edge Computing

计算机科学 强化学习 任务(项目管理) 架空(工程) 分布式计算 元编程 移动边缘计算 GSM演进的增强数据速率 边缘计算 人工智能 操作系统 经济 管理 程序设计语言
作者
Penglin Dai,Yaorong Huang,Kai‐Wen Hu,Xiao Wu,Huanlai Xing,Zhaofei Yu
出处
期刊:IEEE Transactions on Mobile Computing [Institute of Electrical and Electronics Engineers]
卷期号:: 1-16 被引量:3
标识
DOI:10.1109/tmc.2023.3247579
摘要

Mobile edge computing has been a promising solution to enable real-time service in vehicular networks. However, due to high dynamics of mobile environment and heterogeneous features of vehicular services, traditional expert-based or learning-based strategies has to update handcrafted parameters or retrain learning model, which leads to intolerant overhead. Therefore, this paper investigates the problem of multi-task offloading (MTO), where there exist multiple offloading scenarios with varying parameters, such as task topology, resource requirement and transmission/computation capability. The objective is to design a unified solution to minimize task execution time under different MTO scenarios. Accordingly, we develop a Seq2seq-based Meta Reinforcement Learning algorithm for MTO (SMRL-MTO). Specifically, a bidirectional gated recurrent units integrated with attention mechanism is designed to determine offloading action by encoding sequential offloading actions and showing different preferences to different parts of input sequence. Particularly, a meta reinforcement learning framework is designed based on model-agnostic meta learning, which trains a meta policy offline and fast adapts to new MTO scenario within a few training steps. Finally, we conduct performance evaluation based on task generator DAGGEN and realistic vehicular traces, which shows that the SMRL-MTO reduces task execution time by 11.36% on average compared with greedy algorithm.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
2秒前
Thomas完成签到,获得积分10
2秒前
彭于晏应助甜甜盼夏采纳,获得10
2秒前
星辰大海应助科研通管家采纳,获得10
4秒前
思源应助科研通管家采纳,获得10
4秒前
SciGPT应助科研通管家采纳,获得10
4秒前
rocky15应助科研通管家采纳,获得10
4秒前
英俊的铭应助科研通管家采纳,获得10
4秒前
曹文鹏发布了新的文献求助10
4秒前
5秒前
7秒前
jj完成签到,获得积分10
7秒前
9秒前
10秒前
12秒前
13秒前
聪慧凡松发布了新的文献求助10
13秒前
14秒前
俊逸沛山发布了新的文献求助10
14秒前
爱草莓发布了新的文献求助10
15秒前
16秒前
16秒前
17秒前
17秒前
18秒前
18秒前
Ningxin发布了新的文献求助10
18秒前
甜甜盼夏发布了新的文献求助10
19秒前
小盆呐发布了新的文献求助10
20秒前
JamesPei应助科研痛采纳,获得30
20秒前
高万发布了新的文献求助10
20秒前
21秒前
22秒前
传奇3应助俊逸沛山采纳,获得10
22秒前
22秒前
ZZXX发布了新的文献求助10
22秒前
研友_LMy6kL发布了新的文献求助10
23秒前
李爱国应助爱草莓采纳,获得10
25秒前
25秒前
高分求助中
Sustainable Land Management: Strategies to Cope with the Marginalisation of Agriculture 1000
Corrosion and Oxygen Control 600
Yaws' Handbook of Antoine coefficients for vapor pressure 500
Python Programming for Linguistics and Digital Humanities: Applications for Text-Focused Fields 500
行動データの計算論モデリング 強化学習モデルを例として 500
Johann Gottlieb Fichte: Die späten wissenschaftlichen Vorlesungen / IV,1: ›Transzendentale Logik I (1812)‹ 400
The role of families in providing long term care to the frail and chronically ill elderly living in the community 380
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2555557
求助须知:如何正确求助?哪些是违规求助? 2179748
关于积分的说明 5621007
捐赠科研通 1901058
什么是DOI,文献DOI怎么找? 949551
版权声明 565592
科研通“疑难数据库(出版商)”最低求助积分说明 504748