xyliu-cs / RISELinks

[NeurIPS'25] Official Implementation of RISE (Reinforcing Reasoning with Self-Verification)

☆30

Alternatives and similar repositories for RISE

Users that are interested in RISE are comparing it to the libraries listed below

Sorting:

RobustNLP / DeRTa
A novel approach to improve the safety of large language models, enabling them to transition effectively from unsafe to safe state.
☆68Updated 4 months ago
HKUNLP / critic-rl
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆114Updated 5 months ago
qishenghu / InstructCoder
InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw
☆62Updated last year
ganler / code-r1
Reproducing R1 for Code with Reliable Rewards
☆257Updated 5 months ago
Gen-Verse / CURE
[NeurIPS 2025 Spotlight] ReasonFlux-Coder: Open-Source LLM Coders with Co-Evolving Reinforcement Learning
☆122Updated 3 weeks ago
amazon-science / llm-code-preference
Training and Benchmarking LLMs for Code Preference.
☆36Updated 10 months ago
KbsdJames / omni-math-rule
The rule-based evaluation subset and code implementation of Omni-MATH
☆23Updated 9 months ago
KbsdJames / Omni-MATH
The official repository of the Omni-MATH benchmark.
☆88Updated 9 months ago
SparksofAGI / MHPP
☆32Updated 3 weeks ago
scaleapi / plansearch
e
☆41Updated 5 months ago
thunlp / DebugBench
The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".
☆82Updated last year
facebookresearch / cruxeval
CRUXEval: Code Reasoning, Understanding, and Execution Evaluation
☆153Updated last year
Ablustrund / APPS_Plus
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback
☆71Updated last year
hanxuhu / SeqIns
The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…
☆29Updated 10 months ago
WeiminXiong / IPR
Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)
☆62Updated 11 months ago
zhenyuhe00 / SWE-Swiss
SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution
☆88Updated 2 weeks ago
microsoft / SWE-bench-Live
[NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!
☆123Updated 2 weeks ago
QwenLM / ProcessBench
Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"
☆172Updated 4 months ago
GAIR-NLP / weak-to-strong-reasoning
☆58Updated last year
TIGER-AI-Lab / AceCoder
The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]
☆88Updated 6 months ago
CodeEditorBench / CodeEditorBench
☆53Updated last year
GAIR-NLP / BeHonest
BeHonest: Benchmarking Honesty in Large Language Models
☆34Updated last year
GAIR-NLP / ReasonEval
[AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy
☆69Updated 9 months ago
crux-eval / eval-arena
☆28Updated last week
SIMONLQY / RethinkMCTS
☆28Updated last year
ChenmienTan / malmen
☆34Updated last year
limenlp / verl
AdaRFT: Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
☆46Updated 3 months ago
yuchen814 / CodeHalu
☆15Updated last year
tengxiaoliu / XoT
[EMNLP 2023] Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts
☆27Updated last year
Zayne-sprague / MuSR
☆52Updated last year