firechecking / CleanRL

Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clarity, ease of use, and educational purposes.
31Updated 11 months ago

Alternatives and similar repositories for CleanRL

Users that are interested in CleanRL are comparing it to the libraries listed below

Sorting: