NineAbyss / S2R

This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"
50Updated 2 weeks ago

Alternatives and similar repositories for S2R:

Users that are interested in S2R are comparing it to the libraries listed below