[NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling
☆658Feb 7, 2026Updated last month
Alternatives and similar repositories for atom
Users that are interested in atom are comparing it to the libraries listed below
Sorting:
- The code and weight for LoVA. LoVA is a novel model for Long-form Video-to-Audio generation. Based on the Diffusion Transformer (DiT) arc…☆15Feb 27, 2025Updated last year
- Agentless🐱: an agentless approach to automatically solve software development problems☆2,019Dec 22, 2024Updated last year
- Optimizing inference proxy for LLMs☆3,381Jan 28, 2026Updated last month
- Pretraining and inference code for a large-scale depth-recurrent language model☆865Dec 29, 2025Updated 2 months ago
- Code for Research Project TLDR☆25Jul 28, 2025Updated 7 months ago
- AFlow & MathAI☆19Feb 24, 2025Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Jan 23, 2025Updated last year
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆109Jun 3, 2025Updated 9 months ago
- This is a unified platform for implementing and evaluating test-time reasoning mechanisms in Large Language Models (LLMs).☆19Jan 16, 2025Updated last year
- [NeurIPS 2025 Spotlight] LLM post-training suite — featuring ReasonFlux, ReasonFlux-PRM, and ReasonFlux-Coder.☆524Sep 27, 2025Updated 5 months ago
- ☆434Oct 4, 2024Updated last year
- Automated Design of Agentic Systems☆10Sep 7, 2024Updated last year
- [COLM 2025] LIMO: Less is More for Reasoning☆1,065Jul 30, 2025Updated 7 months ago
- Democratizing Reinforcement Learning for LLMs☆5,259Updated this week
- Official repo of paper LM2☆47Feb 13, 2025Updated last year
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,261Nov 13, 2025Updated 4 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,232Aug 27, 2025Updated 6 months ago
- System 2 Reasoning Link Collection☆868Mar 16, 2025Updated last year
- O1 Replication Journey☆1,999Jan 14, 2025Updated last year
- ☆52Feb 12, 2025Updated last year
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆686Mar 22, 2025Updated 11 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution☆25Nov 11, 2025Updated 4 months ago
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆19,217Updated this week
- Minimal reproduction of DeepSeek R1-Zero☆12,963Feb 27, 2026Updated 3 weeks ago
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…☆1,349May 16, 2025Updated 10 months ago
- PyTorch implementation of Titans.☆35Jan 20, 2025Updated last year
- A series of technical report on Slow Thinking with LLM☆761Aug 13, 2025Updated 7 months ago
- ☆218Feb 20, 2025Updated last year
- ☆22May 3, 2025Updated 10 months ago
- An Open Large Reasoning Model for Real-World Solutions☆1,539Feb 13, 2026Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Oct 18, 2025Updated 5 months ago
- A library for advanced large language model reasoning☆2,338Jun 10, 2025Updated 9 months ago
- ☆1,033Dec 17, 2024Updated last year
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆28Mar 1, 2025Updated last year
- Recipes to scale inference-time compute of open models☆1,130May 22, 2025Updated 9 months ago
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)☆5,123Dec 13, 2025Updated 3 months ago
- ☆1,347Nov 21, 2024Updated last year
- Official Repo for Open-Reasoner-Zero☆2,086Jun 2, 2025Updated 9 months ago