Bick95 / PPOLinks
Comprehensive Implementation of Proximal Policy Optimization
☆12Updated 4 years ago
Alternatives and similar repositories for PPO
Users that are interested in PPO are comparing it to the libraries listed below
Sorting:
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Updated 3 years ago
- ☆29Updated last year
- PyTorch implementation of "Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs", NeurIPS 2020☆46Updated 5 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆122Updated 4 years ago
- An implementation of MuZero in JAX.☆57Updated 3 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 7 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆92Updated 4 years ago
- QuaRL is an open-source framework for systematically studying the effect of applying quantization to reinforcement learning algorithms.☆79Updated 2 years ago
- ☆18Updated 6 years ago
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆97Updated 7 months ago
- Pytorch implementation of Soft Actor-Critic☆20Updated 5 years ago
- Code and links for over 25,000 trained Atari agents☆98Updated last year
- ☆135Updated last year
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Updated 5 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Updated 2 years ago
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆198Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆122Updated last year
- Minimal implementation of the network layers of the paper "Noisy Networks for Exploration" using Pytorch.☆13Updated 10 months ago
- A Multi-agent Learning Framework☆62Updated 4 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 7 years ago
- Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2021)☆145Updated 3 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆85Updated 2 years ago
- ☆66Updated last year
- Code for the paper "Batch size invariance for policy optimization"☆56Updated 2 years ago
- Learning to Incentivize Other Learning Agents☆35Updated 3 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 5 years ago
- Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning Algorithms☆49Updated last year
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆38Updated 3 years ago
- Awesome RL: Papers, Books, Codes, Benchmarks☆119Updated 2 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 7 years ago