resistzzz / Co-RewardLinks

Co-Reward: Self-supervised RL for LLM Reasoning via Contrastive Agreement
26Updated this week

Alternatives and similar repositories for Co-Reward

Users that are interested in Co-Reward are comparing it to the libraries listed below

Sorting: