datvodinh / recurrent-ppo
A Reinforcement Learning Project using PPO + LSTM
☆73Updated last year
Alternatives and similar repositories for recurrent-ppo:
Users that are interested in recurrent-ppo are comparing it to the libraries listed below
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆128Updated last year
- Deep recurrent Q learning on CartPole-v1 environment☆88Updated last year
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆169Updated last year
- a clean and robust Pytorch implementation of SAC on continuous action space☆73Updated 3 weeks ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆145Updated 11 months ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 4 years ago
- Transformer in RL for decision-making☆97Updated 2 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 2 months ago
- ☆201Updated last year
- Solve BipedalWalkerHardcore-v2 with TD3☆88Updated last year
- Jax and Torch Multi-Agent SAC on PettingZoo API☆80Updated 5 months ago
- DSAC; Distributional Soft Actor-Critic☆125Updated 2 months ago
- ☆102Updated 2 months ago
- implementation of MADDPG using PettingZoo and PyTorch☆139Updated last year
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆51Updated 3 years ago
- I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf☆33Updated 2 years ago
- The implementation of LSTM-TD3.☆79Updated 2 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆75Updated 2 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆142Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆107Updated 3 years ago
- ☆96Updated 3 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆162Updated 10 months ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆176Updated 10 months ago
- This is the official implementation of Multi-Agent PPO.☆105Updated 2 years ago
- Code for our paper: Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation☆109Updated 4 months ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆17Updated last year
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆288Updated 4 years ago
- ☆10Updated 5 months ago
- A simple implementation of Generative Adversarial Imitation Learning with PyTorch☆157Updated 3 years ago
- ☆41Updated last month