zxiangx / LC-R1Links
Code for paper: Optimizing Length Compression in Large Reasoning Models
☆26Updated 2 months ago
Alternatives and similar repositories for LC-R1
Users that are interested in LC-R1 are comparing it to the libraries listed below
Sorting:
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Updated 2 months ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆45Updated 3 months ago
- ☆32Updated 5 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆25Updated 4 months ago
- Scaling Agentic Environments Automatically.☆38Updated 2 weeks ago
- ☆30Updated last month
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated 2 months ago
- ☆14Updated last year
- AgenTracer: A Lightweight Failure Attributor for Agentic Systems☆61Updated last month
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use☆25Updated last month
- ☆38Updated 4 months ago
- ☆14Updated 10 months ago
- Official Repository of RuleReasoner.☆28Updated 6 months ago
- ☆36Updated 2 months ago
- Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆27Updated 3 months ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆21Updated 3 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆124Updated 8 months ago
- ☆23Updated last year
- ☆14Updated 6 months ago
- Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning☆36Updated 2 weeks ago
- DCPO: Dynamic Adaptive Clipping for RL☆45Updated 2 months ago
- ☆24Updated 4 months ago
- ☆31Updated 4 months ago
- ☆53Updated 2 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆70Updated 6 months ago
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆62Updated last month
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆66Updated 6 months ago
- ☆16Updated last year
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated last year
- HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches☆35Updated 2 months ago