maitchison / PPOLinks
Example implemention of the Proximal Policy Optimization algorithm
☆17Updated last year
Alternatives and similar repositories for PPO
Users that are interested in PPO are comparing it to the libraries listed below
Sorting:
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆126Updated last year
- Implementation of Trajectory Transformer with attention caching and batched beam search☆116Updated 2 years ago
- ☆14Updated 4 months ago
- The Arcade Learning Environment (ALE) -- a platform for AI research.☆24Updated last year
- ☆116Updated 2 years ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆90Updated 2 years ago
- Official code repository for Prompt-DT.☆121Updated 3 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆70Updated 3 years ago
- Experiments with transformer based RL algorithms☆22Updated 6 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆40Updated 11 months ago
- ☆19Updated 2 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆38Updated 3 years ago
- This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".☆63Updated 10 months ago
- Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method☆46Updated last year
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆204Updated last year
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆67Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆191Updated 3 years ago
- Benchmarked implementations of Offline RL Algorithms.☆76Updated 11 months ago
- on-policy optimization baselines for deep reinforcement learning☆32Updated 5 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆106Updated 3 years ago
- POMDP wrappers for OpenAI Gym☆15Updated 6 years ago
- Adaptive Attention Span for Reinforcement Learning☆136Updated 5 years ago
- Object Centric Atari games☆99Updated 2 months ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆49Updated 3 years ago
- PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tu…☆157Updated 2 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 4 years ago
- Implementations of Multi-Task and Meta-Learning baselines for the Metaworld benchmark☆32Updated 5 months ago
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆48Updated 3 years ago
- Code for the paper: "Causal Influence Detection for Improving Efficiency in Reinforcement Learning", by Seitzer, M., Schölkopf, B., Marti…☆47Updated 3 years ago