Value Iteration and Its Variants
Value Iteration is a straightforward implementation of the recursive optimality equation.
- Initialize v0 to some nominal value.
- Compute vi+1 from vi
- Terminate when || vi+1 – vi || is close
Several variants of value iteration try to get faster convergence by using new values of vi+1(s) as soon as they become available