firechecking / CleanRLLinks
Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clarity, ease of use, and educational purposes.
☆34Updated last year
Alternatives and similar repositories for CleanRL
Users that are interested in CleanRL are comparing it to the libraries listed below
Sorting:
- basic algorithms of reinforcement learning☆213Updated 2 years ago
- rl-papers☆48Updated 2 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆91Updated 2 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆56Updated 4 years ago
- ☆169Updated last year
- Source Code☆207Updated last year
- 动手学强化学习代码☆61Updated last year
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆105Updated 4 years ago
- GitHub's code repository is all you need☆355Updated 2 years ago
- The mirror of RL_Coding_Exercise.☆109Updated last year
- An easier PyTorch deep reinforcement learning library.☆239Updated 9 months ago
- [动手学强化学习]系列,基于pytorch。☆57Updated 4 years ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆401Updated 2 months ago
- RL algorithms☆142Updated 4 years ago
- ☆54Updated 4 months ago
- A explaintable and modified version of udacity DRL homework☆26Updated 5 years ago
- ☆66Updated last year
- A clean and robust Pytorch implementation of PPO on continuous action space.☆163Updated last year
- Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.☆172Updated last year
- 强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy based)的代码,代码都经过调试并可以运行☆96Updated last year
- Transformer in RL for decision-making☆100Updated 2 years ago
- ☆87Updated 2 months ago
- Deep recurrent Q learning on CartPole-v1 environment☆92Updated last year
- ☆783Updated 2 years ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆104Updated 3 years ago
- Implement reinforcement learning algorithms in Pytorch☆33Updated 4 years ago
- Learning Resources And Links Of Reinforcement Learning (updating)☆282Updated 4 years ago
- TD3 in Pytorch☆35Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆132Updated 7 months ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆86Updated 5 months ago