zxiangx / LC-R1Links
Code for paper: Optimizing Length Compression in Large Reasoning Models
☆23Updated 2 months ago
Alternatives and similar repositories for LC-R1
Users that are interested in LC-R1 are comparing it to the libraries listed below
Sorting:
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆21Updated last month
- ☆30Updated last month
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆20Updated 3 weeks ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆29Updated 2 weeks ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆69Updated 3 months ago
- Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆24Updated 2 months ago
- ☆35Updated last month
- ARM: Adaptive Reasoning Model☆47Updated last month
- ☆54Updated 2 weeks ago
- ☆24Updated 3 months ago
- ☆40Updated 6 months ago
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆13Updated 6 months ago
- ☆21Updated 2 months ago
- ☆22Updated last week
- Code for Heima☆52Updated 4 months ago
- Official Repo for RuleReasoner.☆26Updated 2 months ago
- ☆14Updated 8 months ago
- ☆34Updated 2 weeks ago
- [arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents☆42Updated 2 months ago
- ☆47Updated 6 months ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆10Updated 8 months ago
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆19Updated 6 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆83Updated 4 months ago
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆40Updated 2 weeks ago
- ☆15Updated 2 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated 3 weeks ago
- ☆23Updated 8 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆35Updated 4 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆35Updated 3 weeks ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆13Updated 5 months ago