Basic procedure

Generalized Policy Iteration (GPI)

Policy iteration, independent of the details of PE and PI processes

Monte Carlo Control

Untitled

Drawback of MC control

SARSA (State-Action-Reward-State-Action)