## Reinforcement Learning | Study

1. Markov Process / Markov chain 1.1. Markov process A Markov process or Markov chain is a tuple $\langle S,P \rangle$ such that $S$ is a finite set of states, and $P$ is a transition probability matrix. In a  Markov process, the initial state should be given. How do we choose the initial state is not a role of […]