AI, ML, RL

RL Problem: A Formal Definition

complex sequential decision-making under uncertainty

Agent and Environment

“Repeated” interactions

Goal: Teaching the agent how to behave by telling it how good it’s doing

  1. observation
  2. take an action
  3. state changes according to the agent’s action
  4. react with new observation and reward
  5. repeat

Terminology