Prove Corollary 1.3 (p. 9) from the script Theory of Reinforcement Learning ^{3}: Every policy for which satisfies the Bellman optimality equations