Back to News Hub
🤖OpenAI
April 18, 2018
General AI

Evolved Policy Gradients

Overview

We're releasing an experimental metalearning approach called Evolved Policy Gradients, a method that evolves the loss function of learning agents, which can enable fast training on novel tasks. Agents trained with EPG can succeed at basic tasks at test time that were outside their training regime, like learning to navigate to an object on a different side of the room from where it was placed during training.

Read the full story at OpenAI

This publisher only syndicates a short excerpt by RSS. The full article — with all the detail, quotes, and context — lives on their site.

Open original article

Continue Learning

Originally published by OpenAI
Read the original

Comments

Sign in to join the conversation