zxiangx / LC-R1Links
Code for paper: Optimizing Length Compression in Large Reasoning Models
☆25Updated 2 months ago
Alternatives and similar repositories for LC-R1
Users that are interested in LC-R1 are comparing it to the libraries listed below
Sorting:
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆22Updated last month
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆22Updated last month
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆36Updated this week
- Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆25Updated 3 months ago
- ☆24Updated 4 months ago
- DCPO: Dynamic Adaptive Clipping for RL☆27Updated last week
- ☆15Updated 3 months ago
- ☆30Updated 2 months ago
- ARM: Adaptive Reasoning Model☆47Updated last month
- ☆14Updated 9 months ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆36Updated 3 weeks ago
- ☆25Updated last week
- ☆21Updated 3 months ago
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆19Updated 6 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆61Updated 3 months ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆18Updated 3 weeks ago
- ☆34Updated 3 weeks ago
- ☆36Updated last month
- [ICML'25] Official code of paper "Fast Large Language Model Collaborative Decoding via Speculation"☆25Updated 2 months ago
- [arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents☆42Updated 2 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆38Updated this week
- ☆54Updated last month
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 3 months ago
- ☆46Updated 2 months ago
- ☆29Updated last month
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated last month
- Official Repo for RuleReasoner.☆26Updated 3 months ago
- The official Github repository for paper "R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation" (EMNLP 2024 Fin…☆36Updated 9 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆107Updated 3 months ago
- ☆47Updated 7 months ago