The Wayback Machine - https://web.archive.org/web/20201108121902/https://github.com/topics/proximal-policy-optimization
Skip to content
#

proximal-policy-optimization

Here are 79 public repositories matching this topic...

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
  • Updated Aug 29, 2020
  • Python

This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
  • Updated Nov 15, 2019
  • Python

PyTorch implementation of some reinforcement learning algorithms: Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), V-MPO, Behavior Cloning (BC). More algorithms will be added.
  • Updated Oct 16, 2020
  • Python

Improve this page

Add a description, image, and links to the proximal-policy-optimization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the proximal-policy-optimization topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.