## Lecture 2: Markov Decision Processes | Reinforcement Learning | David Silver | Course

1. Markov Process / Markov chain 1.1. Markov process A Markov process or Markov chain is a tuple such that is a finite set of states, and is a transition probability matrix. In a Markov process, the initial state should be given. How do we choose the initial state is not a role of the Markov process. 1.2. State […]