AIFrameResearch / SPOLinks

Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models
37Updated 3 weeks ago

Alternatives and similar repositories for SPO

Users that are interested in SPO are comparing it to the libraries listed below

Sorting: