borninfreedom / rlai-exercises
Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]
☆16Updated 4 years ago
Alternatives and similar repositories for rlai-exercises:
Users that are interested in rlai-exercises are comparing it to the libraries listed below
- ☆102Updated 2 months ago
- ☆96Updated 3 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆107Updated 3 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆73Updated last week
- Implement many Sparse Reward algorithms in Gym Fetch environment☆86Updated 4 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆166Updated last year
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆117Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆166Updated last year
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆85Updated 2 years ago
- pytorch实现的一些MARL算法☆66Updated 3 years ago
- This is the official implementation of Multi-Agent PPO.☆104Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆89Updated last year
- A plotter for reinforcement learning (RL)☆223Updated 3 years ago
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆150Updated 9 months ago
- ☆36Updated 10 months ago
- There will be updates later☆84Updated 5 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆50Updated 2 months ago
- ☆16Updated 2 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆74Updated 4 months ago
- MATE: the Multi-Agent Tracking Environment.☆44Updated 2 years ago
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆124Updated last year
- The implementation of LSTM-TD3.☆79Updated 2 years ago
- Intelligent control algorithm and simulation environment.☆17Updated 5 years ago
- Training code PRIMAL2 - Public Repo☆174Updated 11 months ago
- The implementation of AAAI'22 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".☆53Updated last year
- Solve BipedalWalkerHardcore-v2 with TD3☆88Updated last year
- ☆42Updated 3 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆154Updated last year
- ☆200Updated last year
- ☆123Updated 3 years ago