Runtime Adaptation in Wireless Sensor Nodes Using Structured Learning

Adrian Sapio, Shuvra S. Bhattacharyya, Marilyn Wolf

Research output: Contribution to journalArticleScientificpeer-review

3 Citations (Scopus)

Abstract

Markov Decision Processes (MDPs) provide important capabilities for facilitating the dynamic adaptation and self-optimization of cyber physical systems at runtime. In recent years, this has primarily taken the form of Reinforcement Learning (RL) techniques that eliminate some MDP components for the purpose of reducing computational requirements. In this work, we show that recent advancements in Compact MDP Models (CMMs) provide sufficient cause to question this trend when designing wireless sensor network nodes. In this work, a novel CMM-based approach to designing self-aware wireless sensor nodes is presented and compared to Q-Learning, a popular RL technique. We show that a certain class of CPS nodes is not well served by RL methods and contrast RL versus CMM methods in this context. Through both simulation and a prototype implementation, we demonstrate that CMM methods can provide significantly better runtime adaptation performance relative to Q-Learning, with comparable resource requirements.

Original languageEnglish
Article number40
Number of pages28
JournalACM Transactions on Cyber-Physical Systems
Volume4
Issue number4
DOIs
Publication statusPublished - Aug 2020
Publication typeA1 Journal article-refereed

Keywords

  • adaptation
  • LTE-M
  • Markov decision processes
  • reinforcement learning
  • self-awareness

Publication forum classification

  • Publication forum level 1

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Hardware and Architecture
  • Human-Computer Interaction
  • Control and Optimization

Fingerprint

Dive into the research topics of 'Runtime Adaptation in Wireless Sensor Nodes Using Structured Learning'. Together they form a unique fingerprint.

Cite this