RanZhu1989 / RL_PlayGround
This repository is a playground for beginners to learn reinforcement learning. It is a collection of simple environments and agents to get you started with reinforcement learning.
☆26Updated 9 months ago
Alternatives and similar repositories for RL_PlayGround
Users that are interested in RL_PlayGround are comparing it to the libraries listed below
Sorting:
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆100Updated 2 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆76Updated last month
- TD3 in Pytorch☆33Updated 3 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆57Updated 3 years ago
- RL algorithms☆141Updated 4 years ago
- A clean and robust implementation of Prioritized DQN and Prioritized Double DQN☆19Updated 11 months ago
- Official implementation for the NeurIPS 2023 paper: "Reduced Policy Optimization for Continuous Control with Hard Constraints"☆34Updated last year
- DQN by Matlab and Python☆30Updated 5 years ago
- Simple implementation for Constrained Policy Optimization in Pytorch☆16Updated 2 years ago
- PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm☆47Updated 2 years ago
- Implement some algorithms of RL☆47Updated 2 years ago
- Reinforcement learning☆30Updated 2 weeks ago
- RL Dresden Algorithm Suite☆30Updated 9 months ago
- Solve BipedalWalkerHardcore-v2 with TD3☆88Updated last year
- Implement reinforcement learning algorithms in Pytorch☆33Updated 3 years ago
- A pytorch implementation of Constrained Reinforcement Learning Algorithm, including Constrained Soft Actor Critic (Soft Actor Critic Lagr…☆37Updated last year
- ☆32Updated 6 years ago
- Hybrid action space reinforcement learning algorithms.☆12Updated 4 years ago
- Use Multi-Agent Deep Deterministic Policy Gradient(DDPG) algorithm to find reasonable paths for ships☆36Updated 2 years ago
- Adaptive dynamic programming(ADP) 自适应动态规划☆44Updated 4 years ago
- Simple and efficient implementation of DQN DDPG TD3 SAC PPO MADDPG MATD3 MASAC MAAC IPPO MAPPO HAPPO MAT MORL☆64Updated last month
- Source Code☆183Updated last year
- 动手学强化学习代码☆56Updated last year
- ☆170Updated 3 months ago
- 多智能体学习库☆18Updated 3 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆81Updated 5 months ago
- ☆62Updated 2 years ago
- D3QN Pytorch☆63Updated 3 years ago
- Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships☆62Updated 2 years ago
- Transformer-based Multi-Agent Actor-Critic Framework☆45Updated 2 years ago