sanowl / Self-Correcting-LLM--Reinforcement-Learning-

This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by google
29Updated 2 weeks ago

Alternatives and similar repositories for Self-Correcting-LLM--Reinforcement-Learning-:

Users that are interested in Self-Correcting-LLM--Reinforcement-Learning- are comparing it to the libraries listed below