maitchison / PPOLinks
Example implemention of the Proximal Policy Optimization algorithm
☆17Updated last year
Alternatives and similar repositories for PPO
Users that are interested in PPO are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆90Updated 2 years ago
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆126Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆67Updated 2 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆40Updated 11 months ago
- Contains implementation of the FILTER algorithm for exponentially faster inverse reinforcement learning.☆53Updated 2 years ago
- Implementations of Multi-Task and Meta-Learning baselines for the Metaworld benchmark☆32Updated 5 months ago
- Benchmarked implementations of Offline RL Algorithms.☆76Updated 10 months ago
- This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".☆62Updated 9 months ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 3 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆76Updated 2 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 4 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆67Updated 2 years ago
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆47Updated 3 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆70Updated 3 years ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆233Updated 2 months ago
- Scalable Opponent Shaping Experiments in JAX☆25Updated last year
- Challenging Memory-based Deep Reinforcement Learning Agents☆109Updated last year
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆163Updated 4 years ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆132Updated 7 months ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆50Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Updated 7 months ago
- Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)☆10Updated 2 years ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆100Updated 2 years ago
- Code for TRANSDREAMER: REINFORCEMENT LEARNING WITH TRANSFORMER WORLD MODELS☆29Updated 2 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆125Updated 3 years ago
- Object Centric Atari games☆98Updated last month
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Updated 2 years ago
- ☆43Updated 2 years ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆116Updated 2 years ago