**Value and Policy Iteration**
http://www.professeurs.polymtl.ca/jerome.le-ny/teaching/DP_fall09/notes/lec10_VI_PI.pdf

Value and Policy Iteration 1For inﬁnite horizon problems, we need to replace our basic computational tool, the DP algorithm, which we used to compute the optimal cost and policy for ﬁnite horizon problems. We have already encountered in chapter 6 the value iteration (VI) algorithm, which is similar to the DP algorithm and computes

