Proximal Policy Optimization
Dive into the Unknown
Dec 22, 20229 min read981

Search for a command to run...
Series
This series compiles a comparative study of different Reinforcement Learning algorithms, done in collaboration with my B.Sc. students @JoBo, @tobias-huerten, @LucaLiberto and @FlorianSpelter.
Dive into the Unknown

Understanding the Core Concepts

Temporal-Difference Learning and On vs Off-Policy Learning

Deterministic and Stochastic Policies Explained

Mastering the Game of Go without Human Knowledge

Getting started with Reinforcement Learning!
