firechecking / CleanRLLinks
Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clarity, ease of use, and educational purposes.
☆42Updated last year
Alternatives and similar repositories for CleanRL
Users that are interested in CleanRL are comparing it to the libraries listed below
Sorting:
- rl-papers☆50Updated 2 years ago
- basic algorithms of reinforcement learning☆216Updated 2 years ago
- GitHub's code repository is all you need☆373Updated 2 years ago
- An easier PyTorch deep reinforcement learning library.☆252Updated last year
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆58Updated 4 years ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆109Updated 5 years ago
- [动手学强化学习]系列,基于pytorch。☆59Updated 4 years ago
- Source Code☆223Updated last year
- Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.☆190Updated 2 months ago
- ☆173Updated 2 years ago
- Learning Resources And Links Of Reinforcement Learning (updating)☆289Updated 4 years ago
- OpenAI团队的深度强化学习教程中文版☆91Updated 2 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆96Updated 2 years ago
- ☆55Updated 7 months ago
- 动手学强化学习代码☆66Updated 2 years ago
- RL algorithms☆141Updated 4 years ago
- Implement reinforcement learning algorithms in Pytorch☆34Updated 4 years ago
- OpenAI团队的深度强化学习教程中文版☆33Updated 5 years ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆433Updated 2 months ago
- RLlib超参数详解(中文)☆18Updated 4 years ago
- DQN examples codes in chapter 4☆44Updated 2 years ago
- ☆68Updated 2 years ago
- TD3 in Pytorch☆35Updated 4 years ago
- 真-极简强化学习(基于torch的强化学习框架pfrl)☆100Updated 3 years ago
- 强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy based)的代码,代码都经过调试并可以运行☆108Updated 2 years ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆56Updated 2 years ago
- Python Implementation of Reinforcement Learning: An Introduction☆30Updated 6 years ago
- basic theory and code of RL.☆48Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆94Updated 2 years ago
- Transformer in RL for decision-making☆104Updated 2 years ago