Oleg Klimov

5 posts

Retro Contest: Results

Gym Retro

Retro Contest

Retro Contest

We're launching a transfer learning contest that measures a reinforcement learning algorithm's ability to generalize from previous experience.


4 minute read

Proximal Policy Optimization

Proximal Policy Optimization

We’re releasing a new class of reinforcement learning algorithms, Proximal Policy Optimization (PPO), which perform comparably or better than state-of-the-art approaches while being much simpler to implement and tune.


3 minute read

Roboschool