My home page
Biography
Research
Publications
My group
Courses
Professional activities
FAQ
Personal
Papers

Daphne Koller Publications

Reinforcement Learning Using Approximate Belief States (2000)

by A. Rodríguez, R. Parr, and D. Koller


Abstract: The problem of developing good policies for partially observation Markov decision processes (POMDPs) remains one of the most challenging areas of research in stochastic planning. One line of research in this area involves the use of reinforcement learning with belief states, probability distributions over the underlying model states. This is a promising method for small problems, but its application is limited by the intractability of computing or representing a full belief state for large problems. Recent work shows that, in many settings, we can maintain an approximate belief state, which is fairly close to the true belief state. In particular, great success has been shown wiht approximate belief states that marginalize out correlations between state variables. In this paper, we investigate two methods of ull belief state reinforcement learning and one novel mmethod for reinforcement learning using factored approximate belief states. We compare the performance of thse algorithms on several well-known problems from the literature. Our results demonstrate the importance of approximate belief state representations for large problems.


Download Information

A. Rodríguez, R. Parr, and D. Koller (2000). "Reinforcement Learning Using Approximate Belief States." Advances in Neural Information Processing Systems (NIPS '99) (pp. 1036-1042). pdf ps.gz

Bibtex citation

@inproceedings{Rodriguez+al:NIPS99,
  author =       "A. Rodr{\'{\i}}guez and R. Parr and D. Koller",
  editor =       "S. A. Solla and T. K. Leen and K.-R. M{\"u}ller",
  booktitle =    "Advances in Neural Information Processing Systems
                 (NIPS '99)",
  title =        "Reinforcement Learning Using Approximate Belief
                 States",
  address =      "Denver, Colorado",
  volume =       "12",
  publisher =    "The {MIT} Press",
  pages =        "1036--1042",
  year =         "2000",
}

full list
Click to go to robotics Click to go to theory Click to go to CS Stanford Click to go to Stanford's Webpage
home | biography | research | papers | my group
courses | professional activities | FAQ | personal