InternLM / POLARLinks

Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.
157Updated last week

Alternatives and similar repositories for POLAR

Users that are interested in POLAR are comparing it to the libraries listed below

Sorting: