AIFrameResearch / SPOLinks

Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models
43Updated 3 months ago

Alternatives and similar repositories for SPO

Users that are interested in SPO are comparing it to the libraries listed below

Sorting: