GCYZSL / MoLALinks

☆161

Alternatives and similar repositories for MoLA

Users that are interested in MoLA are comparing it to the libraries listed below

Sorting:

liuqidong07 / MOELoRA-peft
[SIGIR'24] The official implementation code of MOELoRA.
☆183Updated last year
cmnfriend / O-LoRA
☆186Updated last year
TUDB-Labs / MixLoRA
State-of-the-art Parameter-Efficient MoE Fine-tuning Method
☆192Updated last year
sail-sg / sdft
[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
☆131Updated 11 months ago
Ablustrund / LoRAMoE
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment
☆380Updated last year
TUDB-Labs / MoE-PEFT
An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT
☆125Updated 7 months ago
hemingkx / TokenSkip
[EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs
☆186Updated 3 months ago
Outsider565 / LoRA-GA
☆212Updated last year
Clin0212 / HydraLoRA
[NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning
☆227Updated 10 months ago
hahahawu / Long-to-Short-via-Model-Merging
Model merging is a highly efficient approach for long-to-short reasoning.
☆89Updated last week
circle-hit / SAPT
Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …
☆36Updated 9 months ago
wutaiqiang / MoSLoRA
☆119Updated last year
Dereck0602 / Awesome_Test_Time_LLMs
☆129Updated 7 months ago
horseee / CoT-Valve
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
☆86Updated 8 months ago
gyhdog99 / MoCLE
MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)
☆44Updated 3 months ago
zzz47zzz / spurious-forgetting
[ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"
☆54Updated 5 months ago
yushuiwx / Mixture-of-LoRA-Experts
☆54Updated 10 months ago
songmzhang / DSKD
Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same…
☆60Updated last month
TsinghuaC3I / SoRA
[EMNLP 2023, Main Conference] Sparse Low-rank Adaptation of Pre-trained Language Models
☆83Updated last year
Chongjie-Si / Subspace-Tuning
A generalized framework for subspace tuning methods in parameter efficient fine-tuning.
☆157Updated 3 months ago
getao / icae
The repo for In-context Autoencoder
☆145Updated last year
EIT-NLP / Awesome-Latent-CoT
This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.
☆177Updated this week
THU-KEG / AdaptThink
☆157Updated 2 weeks ago
Joshua-Ren / Learning_dynamics_LLM
☆174Updated 5 months ago
ChasonShi / MELoRA
code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"
☆32Updated 8 months ago
QingyangZhang / Label-Free-RLVR
☆275Updated 3 months ago
lancopku / label-words-are-anchors
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
☆165Updated last year
OpenSparseLLMs / LLaMA-MoE-v2
🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
☆88Updated 10 months ago
yubol-bobo / Awesome-Multi-Turn-LLMs
This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …
☆125Updated 5 months ago
GeniusHTX / TALE
☆133Updated last month