#machine-learning
Read more stories on Hashnode
Articles with this tag
Dive into the Unknown · Schulman et al. suggest a new policy gradient-based reinforcement learning approach that maintains some of the advantages of...
Understanding the Core Concepts · This post will introduce the core concepts underlying various policy gradient algorithms. As opposed to previously...
Temporal-Difference Learning and On vs Off-Policy Learning · Today, we'll look at two RL algorithms that appear to be identical on the surface but have a...
Deterministic and Stochastic Policies Explained · In Reinforcement Learning (RL), a policy is a description of how an agent behaves given its current...
Mastering the Game of Go without Human Knowledge · In October 2015 DeepMind's AlphaGo Fan beat the European Go champion Fan Hui in the game of Go...
Why should you be concerned about the future of technology and your life in general? · The technological singularity is a hypothesis that the creation of...