efficientscaling / Z1Links

Repo for "Z1: Efficient Test-time Scaling with Code"

☆63

Alternatives and similar repositories for Z1

Users that are interested in Z1 are comparing it to the libraries listed below

Sorting:

TIGER-AI-Lab / CritiqueFineTuning
Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]
☆169Updated 3 weeks ago
TIGER-AI-Lab / General-Reasoner
General Reasoner: Advancing LLM Reasoning Across All Domains
☆156Updated last month
GAIR-NLP / OctoThinker
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆159Updated last week
THUDM / T1
RL Scaling and Test-Time Scaling (ICML'25)
☆109Updated 6 months ago
SalesforceAIResearch / GemFilter
☆82Updated 6 months ago
TIGER-AI-Lab / AceCoder
The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]
☆87Updated 3 months ago
MiniMax-AI / SynLogic
The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond
☆160Updated 3 weeks ago
YangLing0818 / SuperCorrect-llm
[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
☆76Updated 4 months ago
HKUNLP / critic-rl
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆105Updated 2 months ago
Infini-AI-Lab / Multiverse
☆79Updated last week
nick7nlp / FastCuRL
FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning
☆52Updated 2 months ago
hkust-nlp / PreSelect
[ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches
☆53Updated 5 months ago
GuanghaoYe / Emergence-of-Thinking
☆53Updated 5 months ago
ReasoningTransfer / Transferability-of-LLM-Reasoning
☆80Updated 2 weeks ago
QwenLM / Self-Lengthen
☆87Updated 8 months ago
zhijie-group / SIFT
SIFT: Grounding LLM Reasoning in Contexts via Stickers
☆57Updated 4 months ago
imagination-research / lbt
[NeurIPS 2024] Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study
☆52Updated 8 months ago
RM-R1-UIUC / RM-R1
RM-R1: Unleashing the Reasoning Potential of Reward Models
☆118Updated last month
NuoJohnChen / JudgeLRM
☆32Updated 3 months ago
MiroMindAsia / MiroMind-M1
☆84Updated last week
sail-sg / scaling-with-vocab
[NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623
☆86Updated 10 months ago
GAIR-NLP / AIME-Preview
☆71Updated 4 months ago
xufangzhi / phi-Decoding
[ACL 2025] An inference-time decoding strategy with adaptive foresight sampling
☆101Updated 2 months ago
NVlabs / Tool-N1
☆186Updated 2 months ago
Gen-Verse / CURE
Open-Source LLM Coders with Co-Evolving Reinforcement Learning
☆103Updated last week
GeniusHTX / TALE
☆126Updated 2 months ago
multimodal-art-projection / LatentCoT-Horizon
📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.
☆171Updated this week
sunblaze-ucb / Intuitor
Code for the paper: "Learning to Reason without External Rewards"
☆337Updated 3 weeks ago
DAMO-NLP-SG / LongPO
[ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
☆38Updated 5 months ago
menhguin / minp_paper
Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper
☆37Updated 4 months ago