datvodinh / recurrent-ppo
A Reinforcement Learning Project using PPO + LSTM
☆63Updated last year
Alternatives and similar repositories for recurrent-ppo:
Users that are interested in recurrent-ppo are comparing it to the libraries listed below
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆163Updated 11 months ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆121Updated 11 months ago
- DSAC; Distributional Soft Actor-Critic☆125Updated last month
- a clean and robust Pytorch implementation of SAC on continuous action space☆71Updated 9 months ago
- The implementation of LSTM-TD3.☆76Updated 2 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆49Updated last month
- Deep recurrent Q learning on CartPole-v1 environment☆87Updated last year
- Baseline implementation of recurrent PPO using truncated BPTT☆137Updated 10 months ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆50Updated 3 years ago
- ☆10Updated 4 months ago
- This is the official implementation of Multi-Agent PPO.☆104Updated 2 years ago
- Collection of OpenAI parametrized action-space environments.☆64Updated last week
- Transformer in RL for decision-making☆96Updated 2 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆71Updated 2 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆72Updated 4 months ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆63Updated 3 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆85Updated last year
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆143Updated last year
- ☆96Updated 3 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆105Updated 3 years ago
- ☆197Updated last year
- ☆102Updated last month
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆15Updated last year
- Implementation of PPO Lagrangian in PyTorch☆38Updated 2 years ago
- A collection of recent MARL papers☆87Updated 4 months ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆160Updated 8 months ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆70Updated 5 years ago
- I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf☆31Updated 2 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆170Updated 9 months ago
- A clean and robust Pytorch implementation of TD3 on continuous action space☆26Updated 9 months ago