AIFrameResearch / SPOLinks

Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models
41Updated last month

Alternatives and similar repositories for SPO

Users that are interested in SPO are comparing it to the libraries listed below

Sorting: