RPC2 / PPOLinks
A concise PyTorch implementation of Proximal Policy Optimization(PPO) solving CartPole-v0
☆16Updated 5 years ago
Alternatives and similar repositories for PPO
Users that are interested in PPO are comparing it to the libraries listed below
Sorting:
- A simple implementation of Generative Adversarial Imitation Learning with PyTorch☆174Updated 3 years ago
- Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2020)☆250Updated 4 years ago
- ☆1,311Updated last year
- Tools for accelerating safe exploration research.☆581Updated 2 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆529Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic (SAC)☆586Updated 4 years ago
- Imitation learning algorithms☆559Updated 10 months ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆391Updated 4 years ago
- A gym environment for a miniature racecar using the pybullet physics engine.☆218Updated 2 years ago
- Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms☆306Updated 2 years ago
- Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2021)☆145Updated 3 years ago
- Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.☆454Updated 2 years ago
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆334Updated 2 years ago
- VMAS is a vectorized differentiable simulator designed for efficient Multi-Agent Reinforcement Learning benchmarking. It is comprised of …☆516Updated 2 months ago
- RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code☆714Updated last year
- Lightweight multi-agent gridworld Gym environment☆213Updated 2 years ago
- Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code☆690Updated last week
- Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)☆772Updated 2 years ago
- Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019☆727Updated 2 years ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆359Updated 6 years ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆368Updated 2 years ago
- The RL discord wiki☆258Updated 5 years ago
- Code for conservative Q-learning☆471Updated 4 years ago
- A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities☆487Updated 3 weeks ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆335Updated 4 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆253Updated 5 years ago
- ☆315Updated 4 years ago
- This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.☆441Updated 4 years ago
- Simple and easily configurable 3D FPS-game-like environments for reinforcement learning☆754Updated last month
- A basic 2D maze environment where an agent start from the top left corner and try to find its way to the bottom left corner.☆375Updated 2 years ago