Prove Corollary 1.3 (p. 9) from the script Theory of Reinforcement Learning ^{7}: Every policy for which satisfies the Bellman optimality equations