lzhxmu / CPPO

CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models
66Updated this week

Alternatives and similar repositories for CPPO:

Users that are interested in CPPO are comparing it to the libraries listed below