datvodinh / ppo-transformerLinks
A Reinforcement Learning Project using PPO + Transformer
☆80Updated 2 years ago
Alternatives and similar repositories for ppo-transformer
Users that are interested in ppo-transformer are comparing it to the libraries listed below
Sorting:
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆198Updated last year
- Author's PyTorch implementation of TD7 for online and offline RL☆156Updated 2 years ago
- ☆117Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆93Updated 2 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆223Updated last year
- A Reinforcement Learning Project using PPO + LSTM☆101Updated 2 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆156Updated last year
- ☆98Updated last week
- A simple implementation of Generative Adversarial Imitation Learning with PyTorch☆171Updated 3 years ago
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆97Updated 5 months ago
- ☆55Updated 6 months ago
- Official code of the paper "Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts" at ICLR2024☆36Updated last year
- PPO, DDPG, SAC implementation on mujoco environment☆122Updated 3 years ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆226Updated last year
- [NeurIPS 2024] Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow☆40Updated last year
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆82Updated 2 years ago
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆110Updated last month
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Updated last year
- DSAC; Distributional Soft Actor-Critic☆134Updated 10 months ago
- Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.☆29Updated 7 months ago
- ☆69Updated 5 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆70Updated last year
- Code for "Temporal Difference Learning for Model Predictive Control"☆484Updated 2 years ago
- Transformer in RL for decision-making☆103Updated 2 years ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆81Updated last year
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆175Updated 2 years ago
- Repo for Implicit Diffusion Q-Learning☆119Updated 2 years ago
- A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data…☆40Updated last year
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆171Updated last year