LeapLabTHU/Absolute-Zero-Reasoner

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LeapLabTHU/Absolute-Zero-Reasoner)

LeapLabTHU / Absolute-Zero-Reasoner

Official Repository of Absolute Zero Reasoner

☆1,878

Alternatives and similar repositories for Absolute-Zero-Reasoner

Users that are interested in Absolute-Zero-Reasoner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LeapLabTHU / diver-ct
View on GitHub
☆14Dec 19, 2024Updated last year
LeapLabTHU / UniTTA
View on GitHub
☆21Mar 5, 2025Updated last year
LeapLabTHU / limit-of-RLVR
View on GitHub
repo for paper https://arxiv.org/abs/2504.13837
☆346Dec 17, 2025Updated 7 months ago
SHI-Labs / IMG-Multimodal-Diffusion-Alignment
View on GitHub
IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance, ICCV 2025
☆30Oct 1, 2025Updated 9 months ago
PRIME-RL / TTRL
View on GitHub
[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning
☆1,103Apr 15, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Open-Reasoner-Zero / Open-Reasoner-Zero
View on GitHub
Official Repo for Open-Reasoner-Zero
☆2,096Jun 2, 2025Updated last year
LeapLabTHU / GridMix
View on GitHub
Repository of GridMix (ICLR 2025)
☆36Mar 18, 2025Updated last year
Chengsong-Huang / R-Zero
View on GitHub
[ICLR2026] codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)
☆823Feb 4, 2026Updated 5 months ago
Andrewzh112 / AI-Research-Interview-Lab
View on GitHub
☆31Nov 14, 2025Updated 8 months ago
LeapLabTHU / RvR
View on GitHub
🔥 Regeneration over editing: unlocking more effective image refinement!
☆51May 26, 2026Updated 2 months ago
rllm-org / rllm
View on GitHub
Democratizing Reinforcement Learning for LLMs
☆5,731Updated this week
sunblaze-ucb / Intuitor
View on GitHub
[ICLR 2026] Learning to Reason without External Rewards
☆418Jan 26, 2026Updated 6 months ago
LeapLabTHU / DAT-Jittor
View on GitHub
Jittor implementation of Vision Transformer with Deformable Attention
☆32Mar 1, 2022Updated 4 years ago
Simplified-Reasoning / LUFFY
View on GitHub
Official Repository of "Learning to Reason under Off-Policy Guidance"
☆460Mar 20, 2026Updated 4 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,654Updated this week
jennyzzt / dgm
View on GitHub
Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents
☆2,194Aug 13, 2025Updated 11 months ago
LeapLabTHU / FamO2O
View on GitHub
Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)
☆41Oct 30, 2023Updated 2 years ago
LeapLabTHU / LASNet
View on GitHub
[NeurIPS 2022] Latency-aware Spatial-wise Dynamic Networks
☆25Aug 21, 2023Updated 2 years ago
LeapLabTHU / OVM3D-Det
View on GitHub
☆55Jan 2, 2025Updated last year
mll-lab-nu / RAGEN
View on GitHub
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
☆2,756Updated this week
sail-sg / understand-r1-zero
View on GitHub
Understanding R1-Zero-Like Training: A Critical Perspective
☆1,268Aug 27, 2025Updated 10 months ago
LeapLabTHU / AdaAFforPINNs
View on GitHub
☆19Aug 9, 2023Updated 2 years ago
ypwang61 / One-Shot-RLVR
View on GitHub
[NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example
☆444Mar 11, 2026Updated 4 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
algorithmicsuperintelligence / openevolve
View on GitHub
Open-source implementation of AlphaEvolve
☆6,794Jul 18, 2026Updated last week
sail-sg / VeriFree
View on GitHub
Reinforcing General Reasoning without Verifiers
☆102Jun 24, 2025Updated last year
TIGER-AI-Lab / General-Reasoner
View on GitHub
General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]
☆229Nov 27, 2025Updated 7 months ago
RUC-NLPIR / WebThinker
View on GitHub
[NeurIPS 2025] 🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability
☆1,462Dec 8, 2025Updated 7 months ago
LeapLabTHU / CODA
View on GitHub
CODA: Repurposing Continuous VAEs for Discrete Tokenization
☆37Jul 4, 2025Updated last year
hkust-nlp / simpleRL-reason
View on GitHub
Simple RL training for reasoning
☆3,870Dec 23, 2025Updated 7 months ago
star9988rr / VIPScene
View on GitHub
☆37Dec 2, 2025Updated 7 months ago
Jiayi-Pan / TinyZero
View on GitHub
Minimal reproduction of DeepSeek R1-Zero
☆13,203Feb 27, 2026Updated 4 months ago
LeapLabTHU / AdaptiveNN-Jittor
View on GitHub
☆33May 27, 2026Updated last month
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
LeapLabTHU / LAUDNet
View on GitHub
[IEEE TPAMI] Latency-aware Unified Dynamic Networks for Efficient Image Recognition
☆53Mar 20, 2025Updated last year
Gen-Verse / ReasonFlux
View on GitHub
[NeurIPS 2025 Spotlight] LLM post-training suite — featuring ReasonFlux, ReasonFlux-PRM, and ReasonFlux-Coder.
☆540Sep 27, 2025Updated 9 months ago
LeapLabTHU / AdaNAT
View on GitHub
[ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
☆37Sep 12, 2024Updated last year
QwenLM / ParScale
View on GitHub
Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
☆480May 17, 2025Updated last year
LeapLabTHU / AdaFocusV2
View on GitHub
[CVPR 2022] Official repository of AdaFocusV2.
☆91Dec 15, 2024Updated last year
LeapLabTHU / MOSS
View on GitHub
Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning
☆23Nov 16, 2022Updated 3 years ago
open-thought / reasoning-gym
View on GitHub
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
☆1,468Apr 17, 2026Updated 3 months ago