AIFrameResearch / SPOLinks

Segment Policy Optimization: Improved Credit Assignment in Reinforcement Learning for LLMs
34Updated last week

Alternatives and similar repositories for SPO

Users that are interested in SPO are comparing it to the libraries listed below

Sorting: