pomdp_py Documentation

Overview

pomdp_py is a general purpose POMDP library written in Python and Cython. It features simple and comprehensive interfaces to describe POMDP or MDP problems. Originally written to support POMDP planning research, the interfaces also allow extensions to model-free or model-based learning in (PO)MDPs, multi-agent POMDP planning/learning, and task transfer or transfer learning.

Why pomdp_py? It provides a POMDP framework in Python with clean and intuitive interfaces. This makes POMDP-related research or projects accessible to more people. It also helps sharing code and developing a community.

POMDP stands for Partially Observable Markov Decision Process [6].

The code is available on github. We welcome contributions to this library in:

  1. Implementation of additional POMDP solvers (see Existing POMDP Solvers)

  2. Implementation of additional POMDP domains (see Examples)

  3. Interfacing with existing POMDP libraries (majority in other languages).

  4. Extension of pomdp_py beyond planning (see Use Case Extensions).

Getting Started

Tools

6

Leslie Pack Kaelbling, Michael L Littman, and Anthony R Cassandra. Planning and acting in partially observable stochastic domains. Artificial intelligence, 101(1-2):99–134, 1998.

Brown University AI