计算机科学
RSS
强化学习
无线
马尔可夫决策过程
蓝牙
无线网络
无监督学习
位置感知
人工智能
过程(计算)
机器学习
无线传感器网络
马尔可夫过程
计算机网络
电信
统计
数学
操作系统
作者
You Li,Xin Hu,Yuan Zhuang,Zhouzheng Gao,Peng Zhang,Naser El‐Sheimy
标识
DOI:10.1109/jiot.2019.2957778
摘要
Location is key to spatialize internet-of-things (IoT) data. However, it is challenging to use low-cost IoT devices for robust unsupervised localization (i.e., localization without training data that have known location labels). Thus, this paper proposes a deep reinforcement learning (DRL) based unsupervised wireless-localization method. The main contributions are as follows. (1) This paper proposes an approach to model a continuous wireless-localization process as a Markov decision process (MDP) and process it within a DRL framework. (2) To alleviate the challenge of obtaining rewards when using unlabeled data (e.g., daily-life crowdsourced data), this paper presents a reward-setting mechanism, which extracts robust landmark data from unlabeled wireless received signal strengths (RSS). (3) To ease requirements for model re-training when using DRL for localization, this paper uses RSS measurements together with agent location to construct DRL inputs. The proposed method was tested by using field testing data from multiple Bluetooth 5 smart ear tags in a pasture. Meanwhile, the experimental verification process reflected the advantages and challenges for using DRL in wireless localization.
科研通智能强力驱动
Strongly Powered by AbleSci AI