Preview fragment. Get full access
Reinforcement Learning 2014 Sutton Barto