Reinforcement Learning
Avoid curse of modeling - Use experience instead!
Given only observed state and reward information,
Learn:
- Transition probabilities
- Reward function and discount factor
- Optimal policy
Two main approaches:
- learn the model then infer the policy
- learn the policy without learning the explicit model parameters