Pieter Abbeel

13 posts

Evolved Policy Gradients

Interpretable Machine Learning through Teaching

Learning a Hierarchy

Generalizing from Simulation

Meta-Learning for Wrestling

Learning to Model Other Minds

Better Exploration with Parameter Noise

Better Exploration with Parameter Noise

We've found that adding adaptive noise to the parameters of reinforcement learning algorithms frequently boosts performance. This exploration method is simple to implement and very rarely decreases performance, so it's worth trying on any problem.


4 minute read

Learning to Cooperate, Compete, and Communicate

Robots that Learn

Robots that Learn

We've created a robotics system, trained entirely in simulation and deployed on a physical robot, which can learn a new task after seeing it done once.


3 minute read

Spam Detection in the Physical World

Learning to Communicate

Attacking Machine Learning with Adversarial Examples

Generative Models

Generative Models

This post describes four projects that share a common theme of enhancing or using generative models, a branch of unsupervised learning techniques in machine learning.


12 minute read