alphadl / R1Links

🚀enhanced GRPO with more verifiable rewards and real-time evaluators

☆37

Alternatives and similar repositories for R1

Users that are interested in R1 are comparing it to the libraries listed below

Sorting:

RenShuhuai-Andy / my-tools
my commonly-used tools
☆63Updated 10 months ago
hahahawu / Long-to-Short-via-Model-Merging
Model merging is a highly efficient approach for long-to-short reasoning.
☆91Updated last month
lancopku / label-words-are-anchors
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
☆166Updated last year
xufangzhi / ENVISIONS
[ACL 2025] A Neural-Symbolic Self-Training Framework
☆117Updated 5 months ago
SihengLi99 / LLM-Honesty-Survey
[2025-TMLR] A Survey on the Honesty of Large Language Models
☆63Updated 11 months ago
princeton-nlp / QuRating
[ICML 2024] Selecting High-Quality Data for Training Language Models
☆193Updated last year
dvlab-research / Mr-Ben
This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"
☆50Updated last year
princeton-nlp / CEPE
[ACL 2024] Long-Context Language Modeling with Parallel Encodings
☆167Updated last year
October2001 / ProLong
[ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models
☆58Updated last year
pldlgb / nuggets
☆86Updated last year
Mryangkaitong / deepseek-r1-gsm8k
☆47Updated 9 months ago
OrangeInSouth / DeePEn
A method of ensemble learning for heterogeneous large language models.
☆63Updated last year
namespace-Pt / UltraGist
☆18Updated 11 months ago
InfiMM / Awesome-Multimodal-LLM-for-Math-STEM
Paper collections of multi-modal LLM for Math/STEM/Code.
☆129Updated last week
TianHongZXY / RLVR-Decomposed
[NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"
☆125Updated last month
sail-sg / sdft
[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
☆137Updated last year
WHU-ZQH / ChatGPT-vs.-BERT
🎁[ChatGPT4NLU] A Comparative Study on ChatGPT and Fine-tuned BERT
☆192Updated 2 years ago
tianyi-lab / Superfiltering
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
☆182Updated 5 months ago
SparkJiao / dpo-trajectory-reasoning
[EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".
☆82Updated 10 months ago
Bolin97 / awesome-instruction-selector
Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning
☆47Updated last year
multimodal-art-projection / KORGym
☆52Updated 6 months ago
inclusionAI / PromptCoT
A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…
☆128Updated last month
RUCAIBox / JiuZhang3.0
The code and data for the paper JiuZhang3.0
☆49Updated last year
OpenMatch / UniVL-DR
[ICLR 2023] This is the code repo for our ICLR‘23 paper "Universal Vision-Language Dense Retrieval: Learning A Unified Representation Spa…
☆53Updated last year
HillZhang1999 / ICD
Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"
☆69Updated last year
ShadeCloak / ADORA
☆46Updated 7 months ago
ECNU-ICALK / EduChat-Math
[MM 2025] CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models
☆47Updated last year
KodCode-AI / code-r1
Reproducing R1 for Code with Reliable Rewards
☆12Updated 7 months ago
hkust-nlp / mstar
[ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
☆69Updated 4 months ago
X-PLUG / mPLUG-HalOwl
mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating
☆98Updated last year