sanowl / Self-Correcting-LLM--Reinforcement-Learning-View on GitHub
This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by google
38Jul 9, 2025Updated 8 months ago

Alternatives and similar repositories for Self-Correcting-LLM--Reinforcement-Learning-

Users that are interested in Self-Correcting-LLM--Reinforcement-Learning- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?