Summary of General MDP Model
Input parameters:
- A countable (finite) set of states, S = {s1, …, sn}
- A countable (finite) set of actions, A = {a1, …, am}
- Action transitions: n2m transition probabilities of the form Prob(sj | si, A)
- A value function of the form v(?) ? ?
- mapping from system trajectories or histories into the real numbers
- A fixed or infinite horizon N