datvodinh / ppo-transformerLinks
A Reinforcement Learning Project using PPO + Transformer
☆70Updated 2 years ago
Alternatives and similar repositories for ppo-transformer
Users that are interested in ppo-transformer are comparing it to the libraries listed below
Sorting:
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆191Updated last year
- Baseline implementation of recurrent PPO using truncated BPTT☆153Updated last year
- A simple implementation of Generative Adversarial Imitation Learning with PyTorch☆168Updated 3 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆149Updated 2 years ago
- A Reinforcement Learning Project using PPO + LSTM☆92Updated 2 years ago
- ☆114Updated 2 years ago
- ☆87Updated 3 months ago
- Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.☆29Updated 5 months ago
- PPO, DDPG, SAC implementation on mujoco environment☆119Updated 3 years ago
- [NeurIPS 2024] Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow☆38Updated 11 months ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆214Updated last year
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆375Updated 3 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆169Updated last year
- 🚀 A fast safe reinforcement learning library in PyTorch☆216Updated last year
- A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data…☆39Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆89Updated last year
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆76Updated last year
- Official code of the paper "Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts" at ICLR2024☆31Updated 10 months ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆226Updated 4 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆175Updated 2 years ago
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆105Updated last year
- ☆60Updated 3 months ago
- Transformer in RL for decision-making☆100Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆172Updated 10 months ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆89Updated last year
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆81Updated 3 months ago
- ☆286Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆132Updated 8 months ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆142Updated last year
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆146Updated 3 years ago