zxiangx / LC-R1Links
Code for paper: Optimizing Length Compression in Large Reasoning Models
☆27Updated 3 months ago
Alternatives and similar repositories for LC-R1
Users that are interested in LC-R1 are comparing it to the libraries listed below
Sorting:
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Updated 3 months ago
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use☆29Updated 3 months ago
- ☆33Updated 6 months ago
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆32Updated 2 weeks ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆48Updated 3 weeks ago
- ☆16Updated last year
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆25Updated 5 months ago
- ☆16Updated 7 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆17Updated 3 months ago
- Aligning Agentic World Models via Knowledgeable Experience Learning☆28Updated 2 weeks ago
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆64Updated 3 months ago
- Scaling Long-Horizon LLM Agent via Context-Folding☆106Updated last week
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents☆47Updated this week
- Reagent: Exploring Reasoning Reward Model for Agents☆31Updated this week
- ☆14Updated last year
- "FusionFactory: Fusing LLM Capabilities with Routing Data", Tao Feng, Haozhen Zhang, Zijie Lei, Pengrui Han, Mostofa Patwary, Mohammad Sh…☆19Updated last month
- ☆59Updated 3 weeks ago
- [NeurIPS 2025] Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆30Updated 3 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated last year
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆71Updated 8 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- A comprehensive and efficient long-context model evaluation framework☆30Updated this week
- ☆31Updated 4 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last week
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Updated 4 months ago
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Updated 2 weeks ago
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆81Updated last month
- The official Github repository for paper "R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation" (EMNLP 2024 Fin…☆38Updated last year
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆65Updated 3 weeks ago
- The demo, code and data of FollowRAG☆75Updated 7 months ago