InternLM / POLARLinks

Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.
56Updated this week

Alternatives and similar repositories for POLAR

Users that are interested in POLAR are comparing it to the libraries listed below

Sorting: