n-step Bootstrapping
Chapter 7Random WalkTD(0) vs TD(n) vs MC
Controls
episodes: 0
RMSE TD(0): -
RMSE TD(n): -
RMSE MC: -
Cheat Sheet
Same 5-state random walk as Sutton and Barto: start from C, move left/right with equal probability,
reward 1 only if the walk hits R (otherwise 0 at L).