Bandits
Chapter 01
Markov Decision Processes
Chapter 02
Dynamic Programming
Chapter 03
Monte Carlo Methods
Chapter 04
Temporal-Difference Learning
Chapter 05
n-step Bootstrapping
Chapter 07
Planning and Learning with Tabular Methods
Chapter 08
Eligibility Traces
Chapter 12