firechecking / CleanRLLinks
Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clarity, ease of use, and educational purposes.
☆36Updated last year
Alternatives and similar repositories for CleanRL
Users that are interested in CleanRL are comparing it to the libraries listed below
Sorting:
- rl-papers☆48Updated 2 years ago
- Source Code☆210Updated last year
- GitHub's code repository is all you need☆355Updated 2 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆94Updated 2 years ago
- basic algorithms of reinforcement learning☆214Updated 2 years ago
- [动手学强化学习]系列,基于pytorch。☆58Updated 4 years ago
- 动手学强化学习代码☆62Updated last year
- ☆54Updated 5 months ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆56Updated 4 years ago
- An easier PyTorch deep reinforcement learning library.☆241Updated 10 months ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆107Updated 4 years ago
- Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.☆171Updated last year
- 强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy based)的代码,代码都经过调试并可以运行☆97Updated 2 years ago
- Learning Resources And Links Of Reinforcement Learning (updating)☆283Updated 4 years ago
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆52Updated 5 years ago
- ☆171Updated 2 years ago
- Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow☆28Updated 6 years ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆409Updated 3 months ago
- OpenAI团队的深度强化学习教程中文版☆88Updated 2 years ago
- ☆88Updated 3 months ago
- General Optimal control Problem Solver (GOPS), an easy-to-use PyTorch reinforcement learning solver package for industrial control.☆275Updated 2 weeks ago
- RL algorithms☆141Updated 4 years ago
- basic theory and code of RL.☆48Updated 2 years ago
- 基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)☆130Updated 3 months ago
- The mirror of RL_Coding_Exercise.☆111Updated last year
- A clean and robust Pytorch implementation of PPO on continuous action space.☆165Updated last year
- Code for running RL experiments on continuing (non-episodic) problems.☆20Updated 2 months ago
- ☆90Updated 3 years ago
- RL-code for beginners. Enjoying!☆115Updated 5 years ago
- DSAC; Distributional Soft Actor-Critic☆133Updated 8 months ago