lzhxmu / CPPO

CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models
125Updated 2 weeks ago

Alternatives and similar repositories for CPPO

Users that are interested in CPPO are comparing it to the libraries listed below

Sorting: