ICAPS 2021

Blind Decision Making: Reinforcement Learning with Delayed Observations

Reinforcement Learning Delayed Reinforcement Learning Planning and Control

PaperID: 312

Reinforcement learning typically assumes that the state update from the previous actions happens instantaneously, and thus can be used for making future decisions. However, this may not always be true. When the state update is not available, the decision taken is partly in the blind since it cannot rely on the current state information. This paper proposes an approach, where the delay in the knowledge of the state can be used, and the decisions are made based on the available information which may not include the current state information. One approach could be to include the actions after the last-known state as a part of the state information, however, that leads to an increased state-space making the problem complex and slower in convergence. The proposed algorithm gives an alternate approach where the state space is not enlarged, as compared to the case when there is no delay in the state update. Evaluations on the basic RL environments further illustrate the improved performance of the proposed algorithm.

Session 13: Temporal Planning | Numeric Planning | Reinforcement Learning

RePReL: Integrating Relational Planning and Reinforcement Learning for Effective Abstraction
Authors: Harsha Kokel, Arjun Manoharan, Sriraam Natarajan, Balaraman Ravindran and Prasad Tadepalli
Keywords: Reinforcement LearningPlanningRelational MDPHierarchical

LM-cut and Operator Counting Heuristics for Optimal Numeric Planning with Simple Conditions
Authors: Ryo Kuroiwa, Alexander Shleyfman, Chiara Piacentini, Margarita Castro and J. Christopher Beck
Keywords: numeric planningheuristic searchoptimal planningLM-cutplanning with resourcesoperator-counting

Privacy-Preserving Algorithm for Decoupling of Multi-Agent Plans with Uncertainty
Authors: Yuening Zhang and Brian Williams
Keywords: temporal networktemporal decouplingmulti-agent systemsmulti-agent scheduling and executionprivacy

Blind Decision Making: Reinforcement Learning with Delayed Observations
Authors: Mridul Agarwal and Vaneet Aggarwal
Keywords: Reinforcement LearningDelayed Reinforcement LearningPlanning and Control