borninfreedom / rlai-exercises
Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]
☆16Updated 4 years ago
Alternatives and similar repositories for rlai-exercises:
Users that are interested in rlai-exercises are comparing it to the libraries listed below
- ☆102Updated last month
- A plotter for reinforcement learning (RL)☆222Updated 3 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆84Updated last year
- This is the official implementation of Multi-Agent PPO.☆104Updated 2 years ago
- ☆96Updated 3 years ago
- ☆36Updated 9 months ago
- The implementation of LSTM-TD3.☆77Updated 2 years ago
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆52Updated 5 years ago
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆151Updated 8 months ago
- ☆199Updated last year
- pytorch实现的一些MARL算法☆66Updated 3 years ago
- MATE: the Multi-Agent Tracking Environment.☆44Updated 2 years ago
- ☆123Updated 3 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆87Updated last year
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆165Updated 11 months ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆60Updated 9 months ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆85Updated 4 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆106Updated 3 years ago
- ☆16Updated 2 years ago
- A collection of recent MARL papers☆87Updated 4 months ago
- Implement reinforcement learning algorithms in Pytorch☆33Updated 3 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆73Updated 3 months ago
- ☆93Updated 4 years ago
- Code for Weighted QMIX☆134Updated 4 years ago
- ☆38Updated this week
- ☆10Updated 4 months ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆70Updated 9 months ago
- ☆42Updated 3 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆123Updated 11 months ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆117Updated 2 years ago