Back to News Hub
🤖OpenAI
July 20, 2017
General AI

Proximal Policy Optimization

Overview

We're releasing a new class of reinforcement learning algorithms, Proximal Policy Optimization (PPO), which perform comparably or better than state-of-the-art approaches while being much simpler to implement and tune. PPO has become the default reinforcement learning algorithm at OpenAI because of its ease of use and good performance.

Read the full story at OpenAI

This publisher only syndicates a short excerpt by RSS. The full article — with all the detail, quotes, and context — lives on their site.

Open original article

Continue Learning

Originally published by OpenAI
Read the original

Comments

Sign in to join the conversation