Reinforcement Learning

Learn:
- Transition probabilities
- Reward function and discount factor
- Optimal policy

Two main approaches:
- learn the model then infer the policy
- learn the policy without learning the explicit model parameters

Avoid curse of modeling - Use experience instead!