datvodinh / ppo-transformer
A Reinforcement Learning Project using PPO + Transformer
☆41Updated last year
Alternatives and similar repositories for ppo-transformer:
Users that are interested in ppo-transformer are comparing it to the libraries listed below
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆152Updated 6 months ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆164Updated 7 months ago
- The official code releasement of publications in MARL field of TJU RL lab.☆68Updated 2 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆49Updated 3 years ago
- ☆26Updated 4 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆50Updated 4 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆157Updated 9 months ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆167Updated 6 months ago
- Multi-Agent Deep Reinforcement Learning by using Asynchronous & Impala Proximal Policy Optimization in Pytorch with some explanation☆33Updated 4 years ago
- Implementation for mSAC methods in PyTorch☆40Updated 3 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆92Updated 3 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆130Updated 8 months ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆99Updated 2 years ago
- Datasets with baselines for offline multi-agent reinforcement learning.☆160Updated last week
- PyTorch implementation of GAIL and AIRL based on PPO.☆204Updated 4 years ago
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆19Updated 2 months ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆44Updated 2 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆57Updated last year
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆57Updated 4 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆83Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆186Updated 4 months ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆82Updated last year
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆64Updated 3 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆19Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆160Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆81Updated last year
- A Reinforcement Learning Project using PPO + LSTM☆55Updated last year
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆114Updated 9 months ago