Stepan-Makarenko / ICM-PPO-implementationLinks
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML
☆19Updated last year
Alternatives and similar repositories for ICM-PPO-implementation
Users that are interested in ICM-PPO-implementation are comparing it to the libraries listed below
Sorting:
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆193Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- DSAC; Distributional Soft Actor-Critic☆132Updated 8 months ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆227Updated 4 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆169Updated last year
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆53Updated 7 months ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆292Updated 4 years ago
- A collection of offline reinforcement learning algorithms.☆200Updated 10 months ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆359Updated 2 years ago
- ☆40Updated 3 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆72Updated last year
- The official code releasement of publications in MARL field of TJU RL lab.☆80Updated 3 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆68Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆93Updated last year
- 🚀 A fast safe reinforcement learning library in PyTorch☆216Updated last year
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆325Updated 3 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆375Updated 3 years ago
- This is the official implementation of Multi-Agent PPO.☆118Updated 2 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆214Updated last year
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆142Updated last year
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆172Updated 11 months ago
- ☆102Updated 3 years ago
- There will be updates later☆85Updated 6 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆142Updated last year
- Single-file pytorch implementation of hybrid-SAC☆59Updated 4 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆147Updated 6 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆51Updated last year
- PyTorch implementation of Soft Actor-Critic(SAC).☆105Updated 5 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆53Updated 4 years ago
- Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.☆441Updated 2 years ago