HanggeAi / Life
Life is a library for reinforce learning implemented by PyTorch.
☆13Updated last year
Alternatives and similar repositories for Life:
Users that are interested in Life are comparing it to the libraries listed below
- TD3 in Pytorch☆29Updated 3 years ago
- A collection of multi agent environments based on OpenAI gym.☆21Updated last year
- Deep recurrent Q learning on CartPole-v1 environment☆83Updated last year
- My own implementation of Reinforcement Learning algorithms using Tensorflow 2.0☆29Updated 2 years ago
- Code accompanying paper "Coordinated Proximal Policy Optimization"☆11Updated 2 years ago
- ☆100Updated last month
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆94Updated 2 years ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆52Updated 3 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆112Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆156Updated 9 months ago
- ☆192Updated last year
- implementation of MADDPG using PettingZoo and PyTorch☆122Updated last year
- 动手学强化学习代码☆49Updated last year
- Simple verification experiments codes for multi-agent RL using OpenAI MPE environment☆31Updated 2 years ago
- rl-papers☆47Updated last year
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆114Updated 8 months ago
- D3QN Pytorch☆57Updated 3 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆66Updated 7 months ago
- implementation of MADDPG using PyTorch and multiagent-particle-envs☆32Updated 2 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆60Updated 2 years ago
- ☆34Updated 3 weeks ago
- Solve BipedalWalkerHardcore-v2 with TD3☆83Updated last year
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆52Updated 4 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆47Updated 4 years ago
- Reinforcement Learning Algorithms Based on PyTorch☆17Updated 2 years ago
- Applying Constrained Policy Networks on Highway Environment☆37Updated 3 years ago
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆42Updated 2 years ago
- notes☆26Updated 2 years ago
- ☆93Updated 3 years ago
- ☆41Updated 3 years ago