datvodinh / ppo-transformerLinks
A Reinforcement Learning Project using PPO + Transformer
☆68Updated 2 years ago
Alternatives and similar repositories for ppo-transformer
Users that are interested in ppo-transformer are comparing it to the libraries listed below
Sorting:
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆188Updated last year
- ☆87Updated 2 months ago
- Baseline implementation of recurrent PPO using truncated BPTT☆152Updated last year
- ☆112Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆88Updated last year
- A simple implementation of Generative Adversarial Imitation Learning with PyTorch☆167Updated 3 years ago
- A Reinforcement Learning Project using PPO + LSTM☆93Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆148Updated 2 years ago
- PPO, DDPG, SAC implementation on mujoco environment☆117Updated 3 years ago
- ☆53Updated 3 months ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆211Updated last year
- Transformer in RL for decision-making☆100Updated 2 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆165Updated last year
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆82Updated 2 months ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆358Updated 2 years ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆213Updated 11 months ago
- ☆105Updated 2 months ago
- Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.☆28Updated 4 months ago
- ☆282Updated 3 years ago
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆104Updated last year
- A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data…☆39Updated last year
- A collection of offline reinforcement learning algorithms.☆196Updated 9 months ago
- ☆59Updated 2 months ago
- Code for "Temporal Difference Learning for Model Predictive Control"☆462Updated last year
- NeurIPS 2024 DACER☆138Updated last month
- [NeurIPS 2024] Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow☆39Updated 10 months ago
- A collection of recent MARL papers☆95Updated 10 months ago
- DSAC; Distributional Soft Actor-Critic☆131Updated 7 months ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆373Updated 3 years ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆74Updated last year