[NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling
β643Feb 7, 2026Updated 3 weeks ago
Alternatives and similar repositories for atom
Users that are interested in atom are comparing it to the libraries listed below
Sorting:
- Agentlessπ±: an agentless approach to automatically solve software development problemsβ2,010Dec 22, 2024Updated last year
- Optimizing inference proxy for LLMsβ3,342Jan 28, 2026Updated last month
- Pretraining and inference code for a large-scale depth-recurrent language modelβ864Dec 29, 2025Updated 2 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)β92Jan 23, 2025Updated last year
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Searchβ108Jun 3, 2025Updated 8 months ago
- β52Feb 12, 2025Updated last year
- [NeurIPS 2025 Spotlight] LLM post-training suite for long-CoT reasoning, PRM, and code generation β featuring ReasonFlux, ReasonFlux-PRM,β¦β521Sep 27, 2025Updated 5 months ago
- [COLM 2025] LIMO: Less is More for Reasoningβ1,064Jul 30, 2025Updated 7 months ago
- β434Oct 4, 2024Updated last year
- Modified Beam Search with periodical restartβ12Sep 12, 2024Updated last year
- Democratizing Reinforcement Learning for LLMsβ5,167Updated this week
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ4,085Nov 13, 2025Updated 3 months ago
- Understanding R1-Zero-Like Training: A Critical Perspectiveβ1,219Aug 27, 2025Updated 6 months ago
- β1,033Dec 17, 2024Updated last year
- A library for advanced large language model reasoningβ2,333Jun 10, 2025Updated 8 months ago
- O1 Replication Journeyβ1,999Jan 14, 2025Updated last year
- Code for Research Project TLDRβ25Jul 28, 2025Updated 7 months ago
- β215Feb 20, 2025Updated last year
- PyTorch implementation of Titans.β34Jan 20, 2025Updated last year
- System 2 Reasoning Link Collectionβ868Mar 16, 2025Updated 11 months ago
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.β683Mar 22, 2025Updated 11 months ago
- Official Repo for Open-Reasoner-Zeroβ2,087Jun 2, 2025Updated 8 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ59Oct 18, 2025Updated 4 months ago
- β1,344Nov 21, 2024Updated last year
- An Open Large Reasoning Model for Real-World Solutionsβ1,532Feb 13, 2026Updated 2 weeks ago
- π¦ OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automationβ19,129Updated this week
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reiβ¦β1,328May 16, 2025Updated 9 months ago
- Minimal reproduction of DeepSeek R1-Zeroβ12,853Updated this week
- π Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]β1,172Nov 17, 2025Updated 3 months ago
- πΎ OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.β633Jan 29, 2026Updated last month
- AFlow & MathAIβ19Feb 24, 2025Updated last year
- A series of technical report on Slow Thinking with LLMβ760Aug 13, 2025Updated 6 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.β2,522Updated this week
- A recursive coding agent inpired by RLMsβ142Updated this week
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.β176Jan 16, 2025Updated last year
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhanβ¦β1,598May 23, 2024Updated last year
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.β222May 31, 2025Updated 9 months ago
- Entropy Based Sampling and Parallel CoT Decodingβ3,434Nov 13, 2024Updated last year
- The code and weight for LoVA. LoVA is a novel model for Long-form Video-to-Audio generation. Based on the Diffusion Transformer (DiT) arcβ¦β15Feb 27, 2025Updated last year