firechecking / CleanRLLinks
Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clarity, ease of use, and educational purposes.
☆41Updated last year
Alternatives and similar repositories for CleanRL
Users that are interested in CleanRL are comparing it to the libraries listed below
Sorting:
- rl-papers☆50Updated 2 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆96Updated 2 years ago
- basic algorithms of reinforcement learning☆215Updated 2 years ago
- GitHub's code repository is all you need☆359Updated 2 years ago
- [动手学强化学习]系列,基于pytorch。☆59Updated 4 years ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆109Updated 5 years ago
- Learning Resources And Links Of Reinforcement Learning (updating)☆288Updated 4 years ago
- ☆55Updated 7 months ago
- Source Code☆220Updated last year
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆58Updated 4 years ago
- An easier PyTorch deep reinforcement learning library.☆249Updated last year
- 动手学强化学习代码☆65Updated last year
- ☆174Updated 2 years ago
- basic theory and code of RL.☆48Updated 2 years ago
- RL-code for beginners. Enjoying!☆117Updated 5 years ago
- ☆106Updated 5 months ago
- DQN examples codes in chapter 4☆44Updated 2 years ago
- 强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy based)的代码,代码都经过调试并可以运行☆106Updated 2 years ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆430Updated last month
- PPO, DDPG, SAC implementation on mujoco environment☆124Updated 3 years ago
- 深度强化学习各算法介绍与Pytorch实现☆74Updated last year
- The mirror of RL_Coding_Exercise.☆114Updated last year
- Code for running RL experiments on continuing (non-episodic) problems.☆21Updated 4 months ago
- Transformer in RL for decision-making☆103Updated 2 years ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆46Updated 5 years ago
- ☆99Updated last month
- Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.☆186Updated 2 months ago
- OpenAI团队的深度强化学习教程中文版☆32Updated 5 years ago
- DSAC; Distributional Soft Actor-Critic☆136Updated 10 months ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆169Updated last year