lucidrains / ppo
An implementation of PPO in Pytorch
☆79Updated last week
Alternatives and similar repositories for ppo
Users that are interested in ppo are comparing it to the libraries listed below
Sorting:
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆112Updated 8 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆91Updated last month
- ☆78Updated 6 months ago
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆56Updated 3 months ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆97Updated 6 months ago
- Synthetic Experience Replay☆92Updated 11 months ago
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆117Updated last week
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆100Updated last year
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆22Updated 3 weeks ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆164Updated last year
- Official code repository for Prompt-DT.☆109Updated 2 years ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆66Updated 11 months ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- Pytorch implementation of DreamerV2: Mastering Atari with Discrete World Models, based on the original implementation☆20Updated 2 years ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆56Updated 7 months ago
- ☆27Updated last year
- Simple single-file baselines for Q-Learning in pure-GPU setting☆161Updated last month
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆38Updated 2 years ago
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆45Updated last week
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆45Updated 3 years ago
- Transformer-based World Models☆81Updated 2 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆48Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- Code for TRANSDREAMER: REINFORCEMENT LEARNING WITH TRANSFORMER WORLD MODELS☆25Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 2 years ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆111Updated 2 years ago
- Baselines for Neural MMO -- new users should treat this repo as a starter project☆46Updated 9 months ago
- General Modules for JAX☆64Updated last month
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated 9 months ago