shuhao02 / RouterDCLinks
The code of RouterDC
☆64Updated 3 months ago
Alternatives and similar repositories for RouterDC
Users that are interested in RouterDC are comparing it to the libraries listed below
Sorting:
- ☆136Updated last month
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆85Updated 4 months ago
- [ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆70Updated 3 weeks ago
- [COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?☆79Updated 5 months ago
- This repository collects awesome survey, resource, and paper for lifelong learning LLM agents☆205Updated last month
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆77Updated 5 months ago
- ☆122Updated last month
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆228Updated 2 months ago
- ☆126Updated 2 months ago
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆169Updated 10 months ago
- ☆113Updated 4 months ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond☆268Updated last week
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆130Updated this week
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆68Updated 6 months ago
- An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT☆106Updated 4 months ago
- ☆318Updated last month
- Test-time preferenece optimization (ICML 2025).☆147Updated 2 months ago
- ☆70Updated 3 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆74Updated 3 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆49Updated last month
- [ICML 2024 Oral] A framework for society simulation that supports complex simulation, for example: multi-scene.☆76Updated 11 months ago
- Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆201Updated last week
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆134Updated this week
- MPO: Boosting LLM Agents with Meta Plan Optimization☆63Updated 4 months ago
- ☆145Updated 11 months ago
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆110Updated last month
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆251Updated last week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆226Updated 2 months ago
- [arXiv 2025] Efficient Reasoning Models: A Survey☆227Updated this week
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆166Updated 2 weeks ago