EnnengYang / Efficient-WEMoELinks

Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.

☆16

Alternatives and similar repositories for Efficient-WEMoE

Users that are interested in Efficient-WEMoE are comparing it to the libraries listed below

Sorting:

jaechan-repo / muse_bench
☆28Updated last year
OPTML-Group / Unlearn-Simple
[NeurIPS25] Official repo for "Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning"
☆34Updated 3 weeks ago
Lslland / T-Vaccine
☆18Updated 4 months ago
OPTML-Group / WAGLE
Official repo for NeurIPS'24 paper "WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models"
☆15Updated 10 months ago
licong-lin / negative-preference-optimization
☆67Updated last year
boyiwei / CoTaEval
[NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models
☆17Updated last year
OPTML-Group / SOUL
Official repo for EMNLP'24 paper "SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning"
☆28Updated last year
javiferran / sae_entities
☆63Updated 7 months ago
ethz-spylab / unlearning-vs-safety
☆24Updated last year
Thartvigsen / GRACE
[NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors
☆81Updated 10 months ago
alisawuffles / proxy-tuning
Code associated with Tuning Language Models by Proxy (Liu et al., 2024)
☆121Updated last year
VITA-Group / SEAL
Official code for SEAL: Steerable Reasoning Calibration of Large Language Models for Free
☆44Updated 6 months ago
declare-lab / della
DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling
☆36Updated last year
princeton-nlp / benign-data-breaks-safety
☆41Updated last year
jinzhuoran / RWKU
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024
☆85Updated last year
prateeky2806 / ties-merging
☆196Updated last year
git-disl / Vaccine
This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)
☆47Updated 11 months ago
paul-rottger / xstest
Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"
☆116Updated 8 months ago
fc2869 / lo-fit
LoFiT: Localized Fine-tuning on LLM Representations
☆42Updated 9 months ago
hkust-nlp / Activation_Decoding
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
☆61Updated last year
yuzhaouoe / SAE-based-representation-engineering
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆66Updated 11 months ago
DAMO-NLP-SG / multilingual_analysis
[NeurIPS 2024] How do Large Language Models Handle Multilingualism?
☆42Updated 11 months ago
vinid / safety-tuned-llamas
ICLR2024 Paper. Showing properties of safety tuning and exaggerated safety.
☆87Updated last year
sail-sg / closer-look-LLM-unlearning
[ICLR 2025] A Closer Look at Machine Unlearning for Large Language Models
☆39Updated 10 months ago
roeehendel / icl_task_vectors
☆98Updated 2 years ago
deeplearning-wisc / haloscope
source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"
☆60Updated 6 months ago
zepingyu0512 / neuron-attribution
code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models
☆45Updated 11 months ago
ericwtodd / function_vectors
Function Vectors in Large Language Models (ICLR 2024)
☆181Updated 6 months ago
avalonstrel / Mitigating-the-Alignment-Tax-of-RLHF
☆15Updated last year
logix-project / logix
AI Logging for Interpretability and Explainability🔬
☆130Updated last year