withinmiaov / A-Survey-on-Mixture-of-Experts-in-LLMsLinks

[TKDE'25] The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".

☆440

Alternatives and similar repositories for A-Survey-on-Mixture-of-Experts-in-LLMs

Users that are interested in A-Survey-on-Mixture-of-Experts-in-LLMs are comparing it to the libraries listed below

Sorting:

UbiquitousLearning / Efficient_Foundation_Model_Survey
Survey Paper List - Efficient LLM and Foundation Models
☆258Updated last year
fscdc / Awesome-Efficient-Reasoning-Models
[TMLR 2025] Efficient Reasoning Models: A Survey
☆272Updated last week
hemingkx / Awesome-Efficient-Reasoning
Paper list for Efficient Reasoning.
☆709Updated this week
Eclipsess / Awesome-Efficient-Reasoning-LLMs
[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
☆666Updated last week
LINs-lab / DynMoE
[ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models
☆137Updated 3 months ago
liyunqianggyn / Awesome-LLMs-Pruning
Awesome LLM pruning papers all-in-one repository with integrating all useful resources and insights.
☆129Updated 2 months ago
Clin0212 / HydraLoRA
[NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning
☆227Updated 10 months ago
pprp / Awesome-LLM-Prune
Awesome list for LLM pruning.
☆267Updated 2 weeks ago
codecaution / Awesome-Mixture-of-Experts-Papers
A curated reading list of research in Mixture-of-Experts(MoE).
☆648Updated 11 months ago
MuLabPKU / TransMLA
TransMLA: Multi-Head Latent Attention Is All You Need (NeurIPS 2025 Spotlight)
☆393Updated last month
XiaoYee / Awesome_Efficient_LRM_Reasoning
😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond
☆309Updated last week
NVlabs / Fast-dLLM
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
☆604Updated this week
swordlidev / Efficient-Multimodal-LLMs-Survey
Efficient Multimodal Large Language Models: A Survey
☆373Updated 5 months ago
Ablustrund / LoRAMoE
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment
☆380Updated last year
TUDB-Labs / MixLoRA
State-of-the-art Parameter-Efficient MoE Fine-tuning Method
☆193Updated last year
pprp / Awesome-Efficient-MoE
Efficient Mixture of Experts for LLM Paper List
☆140Updated last month
hemingkx / TokenSkip
[EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs
☆186Updated 4 months ago
Outsider565 / LoRA-GA
☆213Updated last year
XueFuzhao / awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts
☆1,217Updated 10 months ago
GCYZSL / MoLA
☆161Updated last year
ThreeSR / Awesome-Inference-Time-Scaling
Paper List of Inference/Test Time Scaling/Computing
☆319Updated 2 months ago
EnnengYang / Awesome-Model-Merging-Methods-Theories-Applications
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.
☆572Updated last week
October2001 / Awesome-KV-Cache-Compression
📰 Must-read papers on KV Cache Compression (constantly updating 🤗).
☆566Updated 3 weeks ago
yongliang-wu / DFT
[Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.
☆481Updated last week
ZJU-LLMs / Awesome-LoRAs
☆223Updated last year
tiingweii-shii / Awesome-Resource-Efficient-LLM-Papers
a curated list of high-quality papers on resource-efficient LLMs 🌱
☆141Updated 7 months ago
Chongjie-Si / Subspace-Tuning
A generalized framework for subspace tuning methods in parameter efficient fine-tuning.
☆157Updated 4 months ago
zwxandy / Awesome-Efficient-CoT-Reasoning-Summary
🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasonin…
☆63Updated 5 months ago
ZO-Bench / ZO-LLM
[ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".
☆111Updated 3 months ago
zitian-gao / one-shot-em
One-shot Entropy Minimization
☆186Updated 4 months ago