shuhao02 / RouterDCLinks
The code of RouterDC
☆61Updated last month
Alternatives and similar repositories for RouterDC
Users that are interested in RouterDC are comparing it to the libraries listed below
Sorting:
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆64Updated 3 months ago
- ☆105Updated 2 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆79Updated 3 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆56Updated 5 months ago
- ☆107Updated 2 weeks ago
- ☆62Updated 2 months ago
- A Sober Look at Language Model Reasoning☆52Updated last week
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆99Updated last week
- [COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?☆78Updated 4 months ago
- ☆116Updated last month
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆213Updated 3 weeks ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆69Updated 3 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆68Updated 3 months ago
- [NeurIPS 2024] GITA: Graph to Image-Text Integration for Vision-Language Graph Reasoning☆49Updated 6 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆123Updated 2 months ago
- [ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"☆73Updated 5 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆56Updated 2 months ago
- ☆89Updated last week
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆162Updated 9 months ago
- ☆60Updated 2 weeks ago
- ☆131Updated 2 weeks ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆70Updated 2 months ago
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆145Updated 2 months ago
- ☆57Updated this week
- ☆27Updated last month
- An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT☆99Updated 2 months ago
- [ICML'25] Multi-agent Architecture Search via Agentic Supernet☆58Updated last month
- ☆52Updated last week
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆46Updated last week
- ☆210Updated last week