ICAPS 2021

LM-cut and Operator Counting Heuristics for Optimal Numeric Planning with Simple Conditions

numeric planning heuristic search optimal planning LM-cut planning with resources operator-counting

PaperID: 120

We consider optimal numeric planning with numeric conditions consisting of linear expressions of numeric state variables and actions that increase or decrease numeric state variables by constant quantities. We build on previous research to introduce a new variant of the numeric h^{max} heuristic based on the delete-relaxed version of such planning tasks. Although our h^{max} heuristic is inadmissible, it yields a numeric version of the classical LM-cut heuristic which is admissible. Further, we prove that our LM-cut heuristic neither dominates nor is dominated by the existing numeric heuristic h^{max}_{hbd}. We show that admissibility also holds when integrating the numeric cuts into the operator-counting (OC) heuristic producing an admissible numeric version of the OC heuristic. Through experiments, we demonstrate that both these heuristics compete favorably with the state-of-the-art heuristics: in particular, while sometimes expanding more nodes than other heuristics, numeric OC solves 19 more problem instances than the next closest heuristic.

Session 13: Temporal Planning | Numeric Planning | Reinforcement Learning

RePReL: Integrating Relational Planning and Reinforcement Learning for Effective Abstraction
Authors: Harsha Kokel, Arjun Manoharan, Sriraam Natarajan, Balaraman Ravindran and Prasad Tadepalli
Keywords: Reinforcement LearningPlanningRelational MDPHierarchical

LM-cut and Operator Counting Heuristics for Optimal Numeric Planning with Simple Conditions
Authors: Ryo Kuroiwa, Alexander Shleyfman, Chiara Piacentini, Margarita Castro and J. Christopher Beck
Keywords: numeric planningheuristic searchoptimal planningLM-cutplanning with resourcesoperator-counting

Privacy-Preserving Algorithm for Decoupling of Multi-Agent Plans with Uncertainty
Authors: Yuening Zhang and Brian Williams
Keywords: temporal networktemporal decouplingmulti-agent systemsmulti-agent scheduling and executionprivacy

Blind Decision Making: Reinforcement Learning with Delayed Observations
Authors: Mridul Agarwal and Vaneet Aggarwal
Keywords: Reinforcement LearningDelayed Reinforcement LearningPlanning and Control