borninfreedom / rlai-exercisesLinks
Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]
☆16Updated 4 years ago
Alternatives and similar repositories for rlai-exercises
Users that are interested in rlai-exercises are comparing it to the libraries listed below
Sorting:
- ☆103Updated 3 months ago
- ☆41Updated 3 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆171Updated last year
- ☆49Updated 2 months ago
- ☆23Updated 2 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 3 months ago
- ☆98Updated 3 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆174Updated last year
- ☆204Updated 2 years ago
- ☆16Updated 2 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆86Updated 2 years ago
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆131Updated 2 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆88Updated 4 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆110Updated 4 years ago
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆153Updated 10 months ago
- A plotter for reinforcement learning (RL)☆224Updated 3 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆77Updated last month
- Implement reinforcement learning algorithms in Pytorch☆33Updated 4 years ago
- Implementation of Soft Actor-Critic with Hindsight Experience Replay☆17Updated 4 years ago
- ☆36Updated 11 months ago
- pytorch实现的一些MARL算法☆66Updated 4 years ago
- Code accompanying paper "Coordinated Proximal Policy Optimization"☆11Updated 3 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆89Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆91Updated last year
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆62Updated 11 months ago
- DSAC; Distributional Soft Actor-Critic☆127Updated 3 months ago
- Code for Weighted QMIX☆136Updated 4 years ago
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆21Updated 4 years ago
- The implementation of LSTM-TD3.☆81Updated 2 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆74Updated 5 months ago