zxiangx / LC-R1Links
Code for paper: Optimizing Length Compression in Large Reasoning Models
☆27Updated 3 months ago
Alternatives and similar repositories for LC-R1
Users that are interested in LC-R1 are comparing it to the libraries listed below
Sorting:
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Updated 3 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆25Updated 5 months ago
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆26Updated last week
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use☆27Updated 2 months ago
- ☆16Updated 7 months ago
- ☆33Updated 6 months ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆48Updated 3 weeks ago
- Scaling Long-Horizon LLM Agent via Context-Folding☆101Updated this week
- ☆59Updated 2 weeks ago
- AgenTracer: A Lightweight Failure Attributor for Agentic Systems☆72Updated 2 months ago
- [ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling☆30Updated 7 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆71Updated 8 months ago
- ☆23Updated last year
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆36Updated last month
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆21Updated 11 months ago
- Aligning Agentic World Models via Knowledgeable Experience Learning☆23Updated last week
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last month
- ☆36Updated 3 months ago
- ☆14Updated last year
- "FusionFactory: Fusing LLM Capabilities with Routing Data", Tao Feng, Haozhen Zhang, Zijie Lei, Pengrui Han, Mostofa Patwary, Mohammad Sh…☆19Updated last month
- [ICLR 2026] Adaptive Thinking via Mode Policy Optimization for Social Language Agents☆47Updated 7 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 8 months ago
- ☆33Updated 2 months ago
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆64Updated 3 months ago
- ☆46Updated 3 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆131Updated 9 months ago
- ☆50Updated 11 months ago
- The demo, code and data of FollowRAG☆75Updated 7 months ago
- DCPO: Dynamic Adaptive Clipping for RL☆45Updated last month
- ☆43Updated 5 months ago