datvodinh / ppo-transformerLinks
A Reinforcement Learning Project using PPO + Transformer
☆86Updated 2 years ago
Alternatives and similar repositories for ppo-transformer
Users that are interested in ppo-transformer are comparing it to the libraries listed below
Sorting:
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆204Updated last year
- ☆106Updated 2 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆161Updated 2 years ago
- A simple implementation of Generative Adversarial Imitation Learning with PyTorch☆174Updated 3 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆160Updated last year
- PPO, DDPG, SAC implementation on mujoco environment☆125Updated 3 years ago
- ☆121Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆94Updated 2 years ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆237Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆231Updated last year
- A Reinforcement Learning Project using PPO + LSTM☆111Updated 2 years ago
- ☆55Updated 8 months ago
- Transformer in RL for decision-making☆104Updated 3 years ago
- Official code of the paper "Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts" at ICLR2024☆41Updated last year
- DSAC; Distributional Soft Actor-Critic☆137Updated last year
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆103Updated 7 months ago
- Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.☆29Updated 9 months ago
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆122Updated 3 months ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆391Updated 4 years ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆85Updated last month
- NeurIPS 2024 DACER☆166Updated 3 weeks ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Updated last year
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆171Updated last year
- An elegant PyTorch offline reinforcement learning library for researchers.☆382Updated 7 months ago
- ☆316Updated 4 years ago
- A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data…☆40Updated last year
- ☆106Updated 6 months ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆176Updated 2 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆147Updated last year