sanowl / Self-Correcting-LLM--Reinforcement-Learning-
View external linksLinks

This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by google
38Jul 9, 2025Updated 7 months ago

Alternatives and similar repositories for Self-Correcting-LLM--Reinforcement-Learning-

Users that are interested in Self-Correcting-LLM--Reinforcement-Learning- are comparing it to the libraries listed below

Sorting:

Are these results useful?