NineAbyss / S2R

This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"
58Updated last month

Alternatives and similar repositories for S2R:

Users that are interested in S2R are comparing it to the libraries listed below