## Reinforcement Learning | Study

References Reinforcement Learning – Algorithms [LINK] About Q-learning and SARSA REINFORCEMENT LEARNING PART 2: SARSA VS Q-LEARNING [LINK] About Q-learning and SARSA

References Reinforcement Learning – Algorithms [LINK] About Q-learning and SARSA REINFORCEMENT LEARNING PART 2: SARSA VS Q-LEARNING [LINK] About Q-learning and SARSA

1. Markov Process / Markov chain 1.1. Markov process A Markov process or Markov chain is a tuple $\langle S,P \rangle$ such that $S$ is a finite set of states, and $P$ is a transition probability matrix. In a Markov process, the initial state should be given. How do we choose the initial state is not a role of […]

Brief information Instructor: David Silver Course homepage: [LINK] Video lecture list: [LINK] Lecture schedule Lecture 1: Introduction to Reinforcement Learning Lecture 2: Markov Decision Processes Lecture 3: Planning by Dynamic Programming Lecture 4: Model-Free Prediction Lecture 5: Model-Free Control Lecture 6: Value Function Approximation Lecture 7: Policy Gradient Methods Lecture 8: Integrating Learning and Planning […]

Seo wordpress plugin by www.seowizard.org.