lzhxmu / CPPO

CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models
117Updated last week

Alternatives and similar repositories for CPPO:

Users that are interested in CPPO are comparing it to the libraries listed below