bobxwu / learning-from-rewards-llm-papersView on GitHub
A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward models and learning strategies across training, inference, and post-inference stages.
66Jun 13, 2025Updated 9 months ago

Alternatives and similar repositories for learning-from-rewards-llm-papers

Users that are interested in learning-from-rewards-llm-papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?