Scalable Probabilistic Planning

MDP Solvers That Scale

Overview

Being PSPACE-complete, even MDPs that describe relatively simple planning problems often defeat modern solvers by forcing them to run out of physical memory or take unreasonably long to produce a solution. Increasing scalability of techniques for solving MDPs is thus a central research topic in planning under uncertainty, and we address it in multiple ways.

One approach is based on the insight that, although an MDP's state space may be too small to fit in main memory, we can spill parts of the state-value table to disk without large sacrifices in speed. Namely, to efficiently perform value iteration, we partition the state space so that the number of state transitions between the partition cells is small. The cells are then loaded into main memory from disc one-by-one in a specially chosen order and have their states' values iteratively updated. The resulting method, called PEMVI, finds optimal policies and does only a few disk accesses in the process.

Unfortunately, some MDPs are too large to be solved optimally by any of the existing methods; the best we can hope for is an approximation. This motivates our other MDP solver, ReTrASE. ReTrASE employs a deterministic planner to automatically extract helpful structural features of a probabilistic planning problem and analyzes them to construct a policy. Experiments show ReTrASE to outperform state-of-the-art algorithms in terms of both efficiency and policy quality on many large MDPs. Ideas realized in ReTrASE also form the basis of heuristic functions GOTH and SixthSense.

This project is supported by National Science Foundation grants IIS-1016713 and IIS-1016465.

Publications

Artificial Intelligence for Artificial Artificial Intelligence Artificial Intelligence for Artificial Artificial Intelligence
Peng Dai, Mausam and Daniel S. Weld
AAAI Conference on Artificial Intelligence, 2011. Full Paper (PDF)
Artificial Intelligence for Artificial Artificial Intelligence Artificial Intelligence for Artificial Artificial Intelligence
Peng Dai, Mausam and Daniel S. Weld
Human Computation Workshop, 2011. Full Paper (PDF)
Human Intelligence Needs Artificial Intelligence Human Intelligence Needs Artificial Intelligence
Daniel S. Weld, Mausam and Peng Dai
Human Computation Workshop, 2011. Full Paper (PDF)
Heuristic Search for Generalized Stochastic Shortest Path MDPs Heuristic Search for Generalized Stochastic Shortest Path MDPs
Andrey Kolobov, Mausam, Daniel S. Weld and Hector Geffner
International Conference on Automated Planning and Scheduling, 2011. Full Paper (PDF)
Decision-Theoretic Control for Crowdsourced Workflows Decision-Theoretic Control for Crowdsourced Workflows
Peng Dai, Mausam and Daniel S. Weld
AAAI Conference on Artificial Intelligence, 2010. Full Paper (PDF)
SixthSense: Fast and Reliable Recognition of Dead Ends in MDPs SixthSense: Fast and Reliable Recognition of Dead Ends in MDPs
Andrey Kolobov, Mausam and Daniel S. Weld
AAAI Conference on Artificial Intelligence, 2010. Full Paper (PDF)
Classical Planning in MDP Heuristics: With a Little Help from Generalization Classical Planning in MDP Heuristics: With a Little Help from Generalization
Andrey Kolobov, Mausam and Daniel S. Weld
International Conference on Automated Planning and Scheduling, 2010. Full Paper (PDF)
Focused Topological Value Iteration Focused Topological Value Iteration
Peng Dai, Mausam and Daniel S. Weld
International Conference on Automated Planning and Scheduling, 2009. Full Paper (PDF)
Domain-Independent, Automatic Partitioning for Probabilistic Planning Domain-Independent, Automatic Partitioning for Probabilistic Planning
Peng Dai, Mausam and Daniel S. Weld
International Joint Conference on Artificial Intelligence, 2009. Full Paper (PDF)
ReTrASE: Intergating Paradigms for Approximate Probabilistic Planning
Andrey Kolobov, Mausam and Daniel S. Weld
International Joint Conference on Artificial Intelligence, 2009. Full Paper (PDF)
A Heuristic Search Approach to Planning with Continuous Resources in Stochastic Domains
Nicolas Meuleau, Emmanuel Benazera, Ronen Brafman, Eric Hansen and Mausam
Journal of Artificial Intelligence Research, 2009. Journal Article (PDF)
Partitioned External Memory Value Iteration Partitioned External Memory Value Iteration
Peng Dai, Mausam and Daniel S. Weld
AAAI Conference on Artificial Intelligence, 2008. Full Paper (PDF)
A Hybridized Planner for Stochastic Domains A Hybridized Planner for Stochastic Domains
Mausam, Piergiorgio Bertoli and Daniel S. Weld
International Joint Conference on Artificial Intelligence, 2007. Full Paper (PDF)
Planning with Continuous Resources in Stochastic Domains
Mausam, Emmanuel Benazera, Ronen Brafman, Nicolas Meuleau and Eric Hansen
International Joint Conference on Artificial Intelligence, 2005. Full Paper (PDF)