datvodinh / ppo-transformerLinks
A Reinforcement Learning Project using PPO + Transformer
☆76Updated 2 years ago
Alternatives and similar repositories for ppo-transformer
Users that are interested in ppo-transformer are comparing it to the libraries listed below
Sorting:
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆195Updated last year
- Author's PyTorch implementation of TD7 for online and offline RL☆154Updated 2 years ago
- ☆90Updated 4 months ago
- A simple implementation of Generative Adversarial Imitation Learning with PyTorch☆170Updated 3 years ago
- ☆115Updated 2 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆156Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆220Updated last year
- ☆54Updated 5 months ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆92Updated last year
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆80Updated 2 years ago
- PPO, DDPG, SAC implementation on mujoco environment☆121Updated 3 years ago
- [ICLR 2025] Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning.☆75Updated 3 months ago
- Official code of the paper "Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts" at ICLR2024☆36Updated last year
- 🚀 A fast safe reinforcement learning library in PyTorch☆224Updated last year
- A Reinforcement Learning Project using PPO + LSTM☆99Updated 2 years ago
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆105Updated last week
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆174Updated last year
- An elegant PyTorch offline reinforcement learning library for researchers.☆367Updated 4 months ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆171Updated last year
- Repo for Implicit Diffusion Q-Learning☆116Updated last year
- DSAC; Distributional Soft Actor-Critic☆133Updated 9 months ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆376Updated 3 years ago
- ☆66Updated 4 months ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆175Updated 2 years ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆80Updated last year
- Code for "Temporal Difference Learning for Model Predictive Control"☆475Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆181Updated 3 years ago
- ☆295Updated 3 years ago
- A collection of offline reinforcement learning algorithms.☆205Updated 11 months ago
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆91Updated 4 months ago