LINs-lab / DynMoELinks

[ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models

☆121

Alternatives and similar repositories for DynMoE

Users that are interested in DynMoE are comparing it to the libraries listed below

Sorting:

fscdc / Awesome-Efficient-Reasoning-Models
[arXiv 2025] Efficient Reasoning Models: A Survey
☆247Updated 2 weeks ago
horseee / CoT-Valve
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
☆81Updated 5 months ago
OpenSparseLLMs / LLaMA-MoE-v2
🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
☆86Updated 8 months ago
zitian-gao / one-shot-em
One-shot Entropy Minimization
☆172Updated last month
Chongjie-Si / Subspace-Tuning
A generalized framework for subspace tuning methods in parameter efficient fine-tuning.
☆153Updated last month
wutaiqiang / MoSLoRA
☆111Updated last year
SUSTechBruce / LOOK-M
[EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…
☆98Updated 8 months ago
MingyuJ666 / Rope_with_LLM
[ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…
☆75Updated last month
Clin0212 / HydraLoRA
[NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning
☆220Updated 8 months ago
hemingkx / TokenSkip
TokenSkip: Controllable Chain-of-Thought Compression in LLMs
☆171Updated last month
Joshua-Ren / Learning_dynamics_LLM
☆155Updated 2 months ago
Dereck0602 / Awesome_Test_Time_LLMs
☆117Updated 4 months ago
TUDB-Labs / MixLoRA
State-of-the-art Parameter-Efficient MoE Fine-tuning Method
☆175Updated 11 months ago
lzhxmu / VTW
Code release for VTW (AAAI 2025 Oral)
☆47Updated 2 weeks ago
OpenSparseLLMs / Linear-MoE
☆113Updated 2 months ago
GATECH-EIC / ACT
[ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…
☆40Updated last year
DavidFanzz / SCMoE
☆26Updated last year
ThreeSR / Awesome-Inference-Time-Scaling
Paper List of Inference/Test Time Scaling/Computing
☆286Updated last month
TUDB-Labs / MoE-PEFT
An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT
☆108Updated 4 months ago
JieShibo / MoLE
[ICML 2025 Oral] Mixture of Lookup Experts
☆45Updated 2 months ago
OpenSparseLLMs / MoM
☆95Updated 3 months ago
GCYZSL / MoLA
☆148Updated last year
lzhxmu / CPPO
CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models
☆145Updated 2 months ago
ruixin31 / Spurious_Rewards
☆322Updated last week
pprp / Awesome-Efficient-MoE
Efficient Mixture of Experts for LLM Paper List
☆87Updated 7 months ago
waltonfuture / Diff-eRank
[NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models
☆51Updated 2 months ago
testtimescaling / testtimescaling.github.io
"what, how, where, and how well? a survey on test-time scaling in large language models" repository
☆56Updated this week
maomaocun / dLLM-cache
Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…
☆132Updated this week
xuyang-liu16 / Awesome-Token-level-Model-Compression
📚 Collection of token-level model compression resources.
☆144Updated last month
yczhou001 / Awesome-Diffusion-LLM
paper list, tutorial, and nano code snippet for Diffusion Large Language Models.
☆96Updated last month