firechecking / CleanRL
Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clarity, ease of use, and educational purposes.
☆29Updated 10 months ago
Alternatives and similar repositories for CleanRL:
Users that are interested in CleanRL are comparing it to the libraries listed below
- [动手学强化学习]系列,基于pytorch。☆54Updated 3 years ago
- rl-papers☆47Updated 2 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆88Updated last year
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆57Updated 3 years ago
- Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow☆27Updated 5 years ago
- RL algorithms☆141Updated 4 years ago
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆52Updated 5 years ago
- TD3 in Pytorch☆31Updated 3 years ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆318Updated last month
- An easier PyTorch deep reinforcement learning library.☆201Updated 4 months ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆143Updated 10 months ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- DSAC; Distributional Soft Actor-Critic☆125Updated 2 months ago
- GitHub's code repository is all you need☆349Updated 2 years ago
- Code for running RL experiments on continuing (non-episodic) problems.☆17Updated last week
- 《强化学习-原理与Python实现》的Pytorch实现。☆59Updated 4 years ago
- Source Code☆178Updated last year
- ☆164Updated last year
- 深度强化学习各算法介绍与Pytorch实现☆51Updated 9 months ago
- Implement reinforcement learning algorithms in Pytorch☆33Updated 3 years ago
- ☆59Updated 2 months ago
- A clean and robust Pytorch implementation of PPO on Discrete action space☆67Updated 10 months ago
- basic algorithms of reinforcement learning☆210Updated last year
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆126Updated last year
- reinforcement learning algorithm for mapless navigation☆68Updated 3 years ago
- ☆39Updated 3 weeks ago
- DQN examples codes in chapter 4☆43Updated 2 years ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆99Updated 4 years ago
- 动手学强化学习代码☆53Updated last year
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆166Updated last year