InternLM / POLARLinks

Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.
163Updated 3 months ago

Alternatives and similar repositories for POLAR

Users that are interested in POLAR are comparing it to the libraries listed below

Sorting: