GAIR-NLP / LIMILinks
LIMI: Less is More for Agency
☆69Updated this week
Alternatives and similar repositories for LIMI
Users that are interested in LIMI are comparing it to the libraries listed below
Sorting:
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆32Updated 3 weeks ago
- ☆70Updated 3 months ago
- ☆97Updated last month
- Efficient Agent Training for Computer Use☆132Updated 3 weeks ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆137Updated last week
- ☆95Updated 2 weeks ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆99Updated 3 weeks ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆34Updated last month
- SSRL: Self-Search Reinforcement Learning☆144Updated last month
- ☆88Updated 4 months ago
- Data Synthesis for Deep Research Based on Semi-Structured Data☆158Updated this week
- Official Implementation of APB (ACL 2025 main Oral)☆31Updated 7 months ago
- [NeurIPS 2025 Spotlight] ReasonFlux-Coder: Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆117Updated last week
- ☆51Updated 2 months ago
- The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆165Updated 2 months ago
- Official Repo for RuleReasoner.☆26Updated 3 months ago
- ☆91Updated 4 months ago
- ☆82Updated 5 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆22Updated last month
- Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling.☆102Updated last month
- Esoteric Language Models☆99Updated 2 months ago
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆71Updated this week
- ☆50Updated 3 months ago
- MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning☆100Updated 2 months ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆83Updated 3 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆64Updated 5 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆58Updated 2 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆37Updated last month
- Resa: Transparent Reasoning Models via SAEs☆41Updated this week
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆176Updated 2 months ago