borninfreedom / rlai-exercisesLinks

Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]

☆16

Alternatives and similar repositories for rlai-exercises

Users that are interested in rlai-exercises are comparing it to the libraries listed below

Sorting:

BIT-aerial-robotics / AquaML
☆103Updated 4 months ago
YangRui2015 / Sparse-Reward-Algorithms
Implement many Sparse Reward algorithms in Gym Fetch environment
☆88Updated 4 years ago
Jonathan-Pearce / DDPG_PER
Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)
☆51Updated 4 months ago
UnrealTracking / mate
MATE: the Multi-Agent Tracking Environment.
☆45Updated 2 years ago
TimeBreaker / Adversarial-Reinforcement-Learning-Papers
Adversarial Reinforcement Learning papers (single-agent setting and multi-agent setting)
☆71Updated 2 years ago
ubiquition / drl
☆23Updated 2 years ago
watakandai / hiro_pytorch
Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)
☆109Updated 4 years ago
chauncygu / Safe-Multi-Agent-Mujoco
Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.
☆64Updated last year
Ericonaldo / ILSwiss
ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…
☆172Updated last year
oxwhirl / facmac
☆99Updated 3 years ago
morning9393 / HAPPO-HATRPO
☆41Updated 3 years ago
apexrl / GCRL-Collection
This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…
☆137Updated 2 years ago
kaixindelele / RHER
The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…
☆155Updated 11 months ago
LSTM-Kirigaya / MaplessNavigation
reinforcement learning algorithm for mapless navigation
☆68Updated 4 years ago
chauncygu / Multi-Agent-Constrained-Policy-Optimisation
Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
☆179Updated last year
XinJingHao / SAC-Continuous-Pytorch
a clean and robust Pytorch implementation of SAC on continuous action space
☆81Updated 2 months ago
yangtao121 / AquaRL
☆16Updated 2 years ago
lich14 / CDS
[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.
☆86Updated 2 years ago
jingranburangyongzhongwen / torchMARL
pytorch实现的一些MARL算法
☆67Updated 4 years ago
YangShengqi / paper
☆42Updated 2 years ago
onewarmheart / IntelligentControl
Intelligent control algorithm and simulation environment.
☆17Updated 5 years ago
Sonkyunghwan / QTRAN
There will be updates later
☆85Updated 6 years ago
XuehaiPan / mate
MATE: the Multi-Agent Tracking Environment.
☆38Updated 2 years ago
ycz0512 / SAC-HER
Implementation of Soft Actor-Critic with Hindsight Experience Replay
☆17Updated 4 years ago
Git-123-Hub / maddpg-pettingzoo-pytorch
implementation of MADDPG using PettingZoo and PyTorch
☆148Updated last year
XinJingHao / TD3-BipedalWalkerHardcore-v2
Solve BipedalWalkerHardcore-v2 with TD3
☆90Updated 2 years ago
TobiasLv / RAD
☆51Updated 3 weeks ago
rlchina / RLCN
☆124Updated 3 years ago
Jingliang-Duan / DSAC-v1
DSAC; Distributional Soft Actor-Critic
☆129Updated 4 months ago
wjh720 / QPLEX
☆95Updated 4 years ago