Kwai-Klear / KlearReasonerLinks

Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization

☆81

Alternatives and similar repositories for KlearReasoner

Users that are interested in KlearReasoner are comparing it to the libraries listed below

Sorting:

GAIR-NLP / OctoThinker
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆182Updated 6 months ago
HKUNLP / critic-rl
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆120Updated 8 months ago
yaof20 / DenseMixer
Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradient
☆64Updated 5 months ago
TIGER-AI-Lab / General-Reasoner
General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]
☆213Updated 2 months ago
test-time-interaction / TTI
☆72Updated 7 months ago
TIGER-AI-Lab / AceCoder
The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]
☆95Updated 9 months ago
DAMO-NLP-SG / LongPO
[ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
☆43Updated 11 months ago
open-compass / CompassVerifier
[EMNLP 2025] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward
☆62Updated 5 months ago
MiniMax-AI / SynLogic
[NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond
☆190Updated 6 months ago
RUCAIBox / Passk_Training
The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''
☆110Updated 5 months ago
sail-sg / scaling-with-vocab
[NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623
☆89Updated last year
mathllm / MathCoder2
☆71Updated last year
SkyworkAI / MindLink
☆100Updated 5 months ago
TingchenFu / MathIF
instruction-following benchmark for large reasoning models
☆44Updated 5 months ago
yyDing1 / ScaleQuest
[ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLM…
☆68Updated last year
Infini-AI-Lab / Multiverse
☆110Updated 4 months ago
GAIR-NLP / AIME-Preview
☆80Updated 10 months ago
THUDM / T1
RL Scaling and Test-Time Scaling (ICML'25)
☆112Updated last year
inclusionAI / PromptCoT
A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…
☆132Updated 3 months ago
Interplay-LM-Reasoning / Interplay-LM-Reasoning
☆130Updated this week
open-compass / GPassK
[ACL 2025] Are Your LLMs Capable of Stable Reasoning?
☆32Updated 5 months ago
verl-project / verl-recipe
A set of examples based on verl for end-to-end RL training recipes.
☆139Updated last week
hkust-nlp / WebExplorer
The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"
☆98Updated 4 months ago
hkust-nlp / mstar
[ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
☆71Updated 6 months ago
ReasoningTransfer / Transferability-of-LLM-Reasoning
☆108Updated last month
YangLing0818 / SuperCorrect-llm
[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
☆86Updated 10 months ago
abdelfattah-lab / SplitReason
☆21Updated last month
zhenyuhe00 / SWE-Swiss
SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution
☆102Updated 4 months ago
GAIR-NLP / OlympicArena
[NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
☆107Updated 10 months ago
sail-sg / SkyLadder
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆42Updated last month