tengwang0318 / hierarchial_reward_modelLinks
Offical Code For "Towards Hierarchical Multi-Step Reward Models for Enhanced Reasoning in Large Language Models"
☆12Updated 4 months ago
Alternatives and similar repositories for hierarchial_reward_model
Users that are interested in hierarchial_reward_model are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆50Updated last month
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆79Updated last month
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆74Updated last month
- TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25☆42Updated last month
- ☆52Updated 5 months ago