firechecking / CleanRL

Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clarity, ease of use, and educational purposes.
26Updated 7 months ago

Alternatives and similar repositories for CleanRL:

Users that are interested in CleanRL are comparing it to the libraries listed below