TUDB-Labs / MoE-PEFTLinks

An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT

☆108

Alternatives and similar repositories for MoE-PEFT

Users that are interested in MoE-PEFT are comparing it to the libraries listed below

Sorting:

TUDB-Labs / MixLoRA
State-of-the-art Parameter-Efficient MoE Fine-tuning Method
☆175Updated 11 months ago
GCYZSL / MoLA
☆148Updated last year
Dereck0602 / Awesome_Test_Time_LLMs
☆117Updated 4 months ago
hemingkx / TokenSkip
TokenSkip: Controllable Chain-of-Thought Compression in LLMs
☆171Updated last month
horseee / CoT-Valve
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
☆81Updated 5 months ago
hahahawu / Long-to-Short-via-Model-Merging
Model merging is a highly efficient approach for long-to-short reasoning.
☆77Updated 2 months ago
Joshua-Ren / Learning_dynamics_LLM
☆155Updated 2 months ago
EIT-NLP / Awesome-Latent-CoT
This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.
☆142Updated 2 weeks ago
LINs-lab / DynMoE
[ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models
☆121Updated last month
GeniusHTX / TALE
☆126Updated 2 months ago
bethgelab / sober-reasoning
A Sober Look at Language Model Reasoning
☆81Updated last month
MingyuJ666 / Rope_with_LLM
[ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…
☆77Updated last month
sail-sg / sdft
[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
☆125Updated 9 months ago
jianghoucheng / AlphaEdit
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)
☆293Updated last month
THU-KEG / AdaptThink
☆140Updated 2 months ago
osehmathias / lisa
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
☆33Updated last year
Alsace08 / Chain-of-Embedding
[ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"
☆71Updated 7 months ago
liuqidong07 / MOELoRA-peft
[SIGIR'24] The official implementation code of MOELoRA.
☆175Updated last year
OpenSparseLLMs / LLaMA-MoE-v2
🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
☆86Updated 8 months ago
StarDewXXX / O1-Pruner
Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning
☆86Updated 5 months ago
getao / icae
The repo for In-context Autoencoder
☆133Updated last year
yafuly / TPO
Test-time preferenece optimization (ICML 2025).
☆155Updated 3 months ago
OpenSparseLLMs / MoM
☆95Updated 3 months ago
sail-sg / Attention-Sink
[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)
☆107Updated last month
yule-BUAA / MergeLLM
Codes for Merging Large Language Models
☆33Updated last year
alisawuffles / proxy-tuning
Code associated with Tuning Language Models by Proxy (Liu et al., 2024)
☆114Updated last year
ZhenweiAn / Dynamic_MoE
Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"
☆60Updated last year
Chaos96 / fourierft
☆147Updated 11 months ago
MingLiiii / Layer_Gradient
[ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
☆71Updated last month
Clin0212 / HydraLoRA
[NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning
☆220Updated 8 months ago