Acting under uncertainty
Overall utility = sum of momentary rewards.
Allows rich preference model, e.g.:
rewards corresponding
to “get to goal asap”
Markov Decision Problem (MDP)
Previous slide
Next slide
Back to first slide
View graphic version