LeapLabTHU / Absolute-Zero-ReasonerLinks
Official Repository of Absolute Zero Reasoner
β1,542Updated 3 weeks ago
Alternatives and similar repositories for Absolute-Zero-Reasoner
Users that are interested in Absolute-Zero-Reasoner are comparing it to the libraries listed below
Sorting:
- Open-source implementation of AlphaEvolveβ2,676Updated this week
- π WebThinker: Empowering Large Reasoning Models with Deep Research Capabilityβ1,042Updated this week
- Darwin GΓΆdel Machine: Open-Ended Evolution of Self-Improving Agentsβ1,346Updated 2 weeks ago
- A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!β1,106Updated 4 months ago
- An Open-source RL System from ByteDance Seed and Tsinghua AIRβ1,364Updated last month
- Democratizing Reinforcement Learning for LLMsβ3,396Updated last month
- β1,148Updated last month
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.β2,016Updated 3 weeks ago
- MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.β2,118Updated last week
- Continuous Thought Machines, because thought takes time and reasoning is a process.β1,026Updated 3 weeks ago
- Training Large Language Model to Reason in a Continuous Latent Spaceβ1,162Updated 5 months ago
- LIMO: Less is More for Reasoningβ963Updated 2 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ2,656Updated this week
- Synthetic data curation for post-training and structured data extractionβ1,414Updated last week
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"β551Updated 3 months ago
- Sky-T1: Train your own O1 preview model within $450β3,272Updated last month
- Pretraining code for a large-scale depth-recurrent language modelβ783Updated 2 weeks ago
- Verifiers for LLM Reinforcement Learningβ1,328Updated this week
- OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's Aβ¦β796Updated 3 weeks ago
- OctoTools: An agentic framework with extensible tools for complex reasoningβ1,197Updated this week
- Dream 7B, a large diffusion language modelβ774Updated last week
- Releases from OpenAI Preparednessβ783Updated 3 weeks ago
- Self-Adapting Language Modelsβ430Updated last week
- Official PyTorch implementation for "Large Language Diffusion Models"β2,378Updated last week
- The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Searchβ1,354Updated last month
- β570Updated 2 months ago
- Atom of Thoughts for Markov LLM Test-Time Scalingβ574Updated last week
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the inputβ737Updated 2 weeks ago
- TTRL: Test-Time Reinforcement Learningβ650Updated 2 weeks ago
- Fully open data curation for reasoning modelsβ1,935Updated 3 weeks ago