liuqidong07 / MOELoRA-peftLinks

[SIGIR'24] The official implementation code of MOELoRA.

☆169

Alternatives and similar repositories for MOELoRA-peft

Users that are interested in MOELoRA-peft are comparing it to the libraries listed below

Sorting:

GCYZSL / MoLA
☆143Updated 11 months ago
sail-sg / sdft
[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
☆124Updated 8 months ago
Ablustrund / LoRAMoE
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment
☆353Updated last year
cmnfriend / O-LoRA
☆179Updated last year
lancopku / label-words-are-anchors
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
☆164Updated last year
maidacundo / MoE-LoRA
Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.
☆46Updated 9 months ago
pldlgb / nuggets
☆84Updated last year
OpenMOSS / Say-I-Dont-Know
[ICML'2024] Can AI Assistants Know What They Don't Know?
☆81Updated last year
hahahawu / Long-to-Short-via-Model-Merging
Model merging is a highly efficient approach for long-to-short reasoning.
☆71Updated last month
hemingkx / TokenSkip
TokenSkip: Controllable Chain-of-Thought Compression in LLMs
☆164Updated 2 weeks ago
tianyi-lab / Superfiltering
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
☆162Updated 3 weeks ago
TUDB-Labs / MixLoRA
State-of-the-art Parameter-Efficient MoE Fine-tuning Method
☆168Updated 10 months ago
ZhenweiAn / Dynamic_MoE
Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"
☆54Updated 11 months ago
circle-hit / SAPT
Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …
☆35Updated 6 months ago
TUDB-Labs / MoE-PEFT
An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT
☆106Updated 4 months ago
ChasonShi / MELoRA
code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"
☆19Updated 4 months ago
QingruZhang / AdaLoRA
AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).
☆336Updated 2 years ago
getao / icae
The repo for In-context Autoencoder
☆129Updated last year
sanowl / OmegaPRM
this is an implementation for the paper Improve Mathematical Reasoning in Language Models by Automated Process Supervision from google de…
☆33Updated last week
songmzhang / DSKD
Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same…
☆56Updated 8 months ago
THU-KEG / AdaptThink
☆132Updated last month
Yiwei98 / TDG
☆28Updated last year
pillowsofwind / Knowledge-Conflicts-Survey
[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"
☆127Updated 9 months ago
OrangeInSouth / DeePEn
A method of ensemble learning for heterogeneous large language models.
☆58Updated 11 months ago
tianyi-lab / Cherry_LLM
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…
☆378Updated 3 weeks ago
HowieHwong / DataGen
[ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models
☆56Updated 4 months ago
princeton-nlp / CEPE
[ACL 2024] Long-Context Language Modeling with Parallel Encodings
☆155Updated last year
princeton-nlp / QuRating
[ICML 2024] Selecting High-Quality Data for Training Language Models
☆178Updated last year
xiaoya-li / Instruction-Tuning-Survey
Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`
☆180Updated 7 months ago
Cohere-Labs-Community / parameter-efficient-moe
☆263Updated last year