maitchison / PPO
Example implemention of the Proximal Policy Optimization algorithm
☆16Updated 9 months ago
Alternatives and similar repositories for PPO
Users that are interested in PPO are comparing it to the libraries listed below
Sorting:
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆48Updated last year
- Implementation of the Off Belief Learning algorithm.☆47Updated 2 years ago
- [NeurIPS 2023] Implementation of Elastic Decision Transformer☆35Updated last year
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆29Updated 3 years ago
- Code for TRANSDREAMER: REINFORCEMENT LEARNING WITH TRANSFORMER WORLD MODELS☆25Updated last year
- ☆111Updated 2 years ago
- ☆86Updated 11 months ago
- Official code repository for Prompt-DT.☆109Updated 2 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆36Updated 3 months ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆112Updated 2 years ago
- Object Centric Atari games☆78Updated this week
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆46Updated 3 years ago
- [ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm☆14Updated 3 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆91Updated last month
- ☆24Updated 7 months ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆104Updated 2 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆20Updated 10 months ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆74Updated last year
- The Implementation of "Machine Theory of Mind", ICML 2018☆24Updated 3 years ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)☆15Updated 9 months ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆18Updated 2 years ago
- ☆13Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆57Updated 7 months ago
- An unofficial implementation for online decision transformer☆40Updated 2 years ago
- AGAC: Adversarially Guided Actor-Critic☆49Updated 3 years ago
- Implementation of Multi-Game Decision Transformers in PyTorch☆46Updated 2 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Updated 2 years ago
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆39Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆86Updated 3 years ago