Partially observable MDPs

The optimal action at time t depends on the entire history of previous observations.

The optimal action at time t depends on the entire history of previous observations.