maidacundo / MoE-LoRALinks

Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.

☆67

Alternatives and similar repositories for MoE-LoRA

Users that are interested in MoE-LoRA are comparing it to the libraries listed below

Sorting:

liuqidong07 / MOELoRA-peft
[SIGIR'24] The official implementation code of MOELoRA.
☆184Updated last year
sail-sg / sdft
[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
☆136Updated last year
GCYZSL / MoLA
☆168Updated last year
TemporaryLoRA / Temp-LoRA
☆119Updated last year
ZhenweiAn / Dynamic_MoE
Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"
☆66Updated last year
pldlgb / nuggets
☆86Updated last year
cmnfriend / O-LoRA
☆190Updated last year
RUCKBReasoning / CoT-based-Synthesizer
Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'
☆31Updated 6 months ago
weiyifan1023 / Neeko
Code and Data for EMNLP 2024 Paper "Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent"
☆136Updated 4 months ago
Ablustrund / LoRAMoE
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment
☆385Updated last year
wutaiqiang / MoSLoRA
☆123Updated last year
lzhxmu / CPPO
CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models (NeurIPS 2025)
☆167Updated 2 weeks ago
THU-KEG / AdaptThink
☆165Updated last month
fzp0424 / MT-R1-Zero
[EMNLP'25] Code for paper "MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning"
☆61Updated 7 months ago
tianyi-lab / Superfiltering
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
☆182Updated 4 months ago
SkyworkAI / Skywork-Reward-V2
Scaling Preference Data Curation via Human-AI Synergy
☆128Updated 4 months ago
yafuly / TPO
Test-time preferenece optimization (ICML 2025).
☆169Updated 6 months ago
nishiwen1214 / Benchmark-leakage-detection
Official completion of “Training on the Benchmark Is Not All You Need”.
☆37Updated 10 months ago
InternLM / Condor
[ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement
☆37Updated 5 months ago
HowieHwong / DataGen
[ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models
☆64Updated 8 months ago
dongguanting / FollowRAG
The demo, code and data of FollowRAG
☆75Updated 4 months ago
yuleiqin / fantastic-data-engineering
Fantastic Data Engineering for Large Language Models
☆92Updated 10 months ago
ChasonShi / MELoRA
code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"
☆33Updated 9 months ago
swt-user / DMPO
☆52Updated last year
ECNU-ICALK / EduChat-Math
[MM 2025] CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models
☆47Updated last year
OpenMOSS / Say-I-Dont-Know
[ICML'2024] Can AI Assistants Know What They Don't Know?
☆83Updated last year
dongguanting / DPA-RAG
The code and data of DPA-RAG, accepted by WWW 2025 main conference.
☆63Updated last month
WisdomShell / RewardAnything
RewardAnything: Generalizable Principle-Following Reward Models
☆44Updated 5 months ago
ZHZisZZ / weak-to-strong-search
[NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
☆63Updated 11 months ago
Outsider565 / LoRA-GA
☆213Updated last year