firechecking / CleanRLLinks

Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clarity, ease of use, and educational purposes.
30Updated last year

Alternatives and similar repositories for CleanRL

Users that are interested in CleanRL are comparing it to the libraries listed below

Sorting: