CASE-Lab-UMD / Router-Tuning-Mixture-of-DepthsLinks
The open-source Mixture of Depths code and the official implementation of the paper "Router-Tuning: A Simple and Effective Approach for Enabling Dynamic Depth in Transformers. (EMNLP 2025)"
☆26Updated 2 months ago
Alternatives and similar repositories for Router-Tuning-Mixture-of-Depths
Users that are interested in Router-Tuning-Mixture-of-Depths are comparing it to the libraries listed below
Sorting:
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆86Updated 6 months ago
- [ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration☆61Updated 10 months ago
- [ICLR‘24 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"☆102Updated 6 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning