Back to Search Start Over

RLProph: a dynamic programming based reinforcement learning approach for optimal routing in opportunistic IoT networks.

Authors :
Sharma, Deepak Kumar
Rodrigues, Joel J. P. C.
Vashishth, Vidushi
Khanna, Anirudh
Chhabra, Anshuman
Source :
Wireless Networks (10220038). Aug2020, Vol. 26 Issue 6, p4319-4338. 20p.
Publication Year :
2020

Abstract

Routing in Opportunistic Internet of Things networks (OppIoTs) is a challenging task because of intermittent connectivity between devices and the lack of a fixed path between the source and destination of messages. Recently, machine learning (ML) and reinforcement learning (RL) have been used with great success to automate processes in a number of different problem domains. In this paper, we seek to fully automate the OppIoT routing process by using the Policy Iteration algorithm to maximize the possibility of message delivery. Moreover, we model the OppIoT environment as a Markov decision process (MDP) replete with states, actions, rewards, and transition probabilities. The proposed routing protocol, RLProph, is able to optimize the routing process via the optimal policy obtained by solving the MDP using Policy Iteration. Through extensive simulations, we show that RLProph outperforms a number of ML-based and context-aware routing protocols on a multitude of performance criteria. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10220038
Volume :
26
Issue :
6
Database :
Academic Search Index
Journal :
Wireless Networks (10220038)
Publication Type :
Academic Journal
Accession number :
143759827
Full Text :
https://doi.org/10.1007/s11276-020-02331-1