firechecking / CleanRL
Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clarity, ease of use, and educational purposes.
☆28Updated 9 months ago
Alternatives and similar repositories for CleanRL:
Users that are interested in CleanRL are comparing it to the libraries listed below
- Solve BipedalWalkerHardcore-v2 with TD3☆85Updated last year
- rl-papers☆48Updated 2 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆57Updated 3 years ago
- basic algorithms of reinforcement learning☆209Updated last year
- RL algorithms☆142Updated 4 years ago
- Implement reinforcement learning algorithms in Pytorch☆33Updated 3 years ago
- 真-极简强化学习(基于torch的强化学习框架pfrl)☆77Updated 3 years ago
- TD3 in Pytorch☆31Updated 3 years ago
- ☆62Updated last year
- ☆162Updated last year
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- An easier PyTorch deep reinforcement learning library.☆190Updated 3 months ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆97Updated 4 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆87Updated last year
- DSAC; Distributional Soft Actor-Critic☆125Updated last month
- Source Code☆169Updated last year
- [动手学强化学习]系列,基于pytorch。☆54Updated 3 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆120Updated 11 months ago
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆52Updated 5 years ago
- 《强化学习-原理与Python实现》的Pytorch实现。☆57Updated 4 years ago
- ☆58Updated last month
- GitHub's code repository is all you need☆347Updated 2 years ago
- Simple Reinforcement learning tutorials☆14Updated 5 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆94Updated 4 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆140Updated 9 months ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆304Updated this week
- 动手学强化学习代码☆51Updated last year
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆96Updated 2 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆93Updated 4 years ago
- OpenAI团队的深度强化学习教程中文版☆76Updated last year