AIFrameResearch / SPOLinks

Segment Policy Optimization: Improved Credit Assignment in Reinforcement Learning for LLMs
16Updated 3 weeks ago

Alternatives and similar repositories for SPO

Users that are interested in SPO are comparing it to the libraries listed below

Sorting: