GAIR-NLP / cs2916Links
β27Updated 7 months ago
Alternatives and similar repositories for cs2916
Users that are interested in cs2916 are comparing it to the libraries listed below
Sorting:
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracyβ73Updated 3 weeks ago
- [NeurIPS'24] Official code for *π―DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*β117Updated 10 months ago
- β13Updated last year
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".β56Updated 11 months ago
- Resources for the Enigmata Project.β72Updated 2 months ago
- β58Updated last year
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.β63Updated last year
- β69Updated last year
- The official repository of the Omni-MATH benchmark.β88Updated 10 months ago
- Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"β174Updated 5 months ago
- Trending projects & awesome papers about data-centric llm studies.β38Updated 5 months ago
- [NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"β115Updated 2 months ago
- β50Updated last year
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".β82Updated 9 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMsβ79Updated 2 years ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.β131Updated 7 months ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"β47Updated last month
- β75Updated 11 months ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied witβ¦β144Updated last year
- β51Updated 5 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LAβ¦β29Updated 11 months ago
- The rule-based evaluation subset and code implementation of Omni-MATHβ24Updated 10 months ago
- The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimizationβ¦β16Updated last year
- Evaluate the Quality of Critiqueβ36Updated last year
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.β81Updated 8 months ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoningβ27Updated last year
- β75Updated last year
- A repo for open research on building large reasoning modelsβ108Updated last week
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)β57Updated 11 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervisionβ124Updated last year