JingXuTHU / Random-Masking-Finds-Winning-Tickets-for-Parameter-Efficient-Fine-tuningLinks

☆14

Alternatives and similar repositories for Random-Masking-Finds-Winning-Tickets-for-Parameter-Efficient-Fine-tuning

Users that are interested in Random-Masking-Finds-Winning-Tickets-for-Parameter-Efficient-Fine-tuning are comparing it to the libraries listed below

Sorting:

QingruZhang / PLATON
This pytorch package implements PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance (ICML 2022).
☆46Updated 3 years ago
abhishekpanigrahi1996 / Skill-Localization-by-grafting
☆51Updated last year
ZO-Bench / ZO-LLM
[ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".
☆118Updated 4 months ago
SempraETY / Pruning-via-Merging
☆23Updated last year
VITA-Group / SEAL
[COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free
☆45Updated 7 months ago
Trustworthy-ML-Lab / ThinkEdit
[EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…
☆16Updated 2 months ago
harveyhuang18 / EMR_Merging
[NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging
☆72Updated 8 months ago
locuslab / massive-activations
Code accompanying the paper "Massive Activations in Large Language Models"
☆187Updated last year
mmatena / model_merging
☆79Updated 3 years ago
nik-dim / tall_masks
Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]
☆51Updated last year
alvin-zyl / CoLA
Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation
☆24Updated 9 months ago
uiuctml / Localize-and-Stitch
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
☆30Updated 2 months ago
osehmathias / lisa
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
☆36Updated last year
yxli2123 / LoSparse
☆62Updated 2 years ago
biomedical-cybernetics / Relative-importance-and-activation-pruning
☆52Updated last year
thunlp / MoEfication
☆142Updated last year
hkust-nlp / PEM_composition
[NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"
☆61Updated 2 years ago
EnnengYang / RepresentationSurgery
Representation Surgery for Multi-Task Model Merging. ICML, 2024.
☆46Updated last year
EnnengYang / AdaMerging
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
☆96Updated last year
aim-uofa / LoRAPrune
☆61Updated 11 months ago
ycjing / Awesome-Model-Merging
A curated list of Model Merging methods.
☆92Updated last year
BeyonderXX / TRACE
TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models
☆81Updated last year
MrGGLS / BlockPruner
A block pruning framework for LLMs.
☆27Updated 6 months ago
raymin0223 / fast_robust_early_exit
Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)
☆64Updated last year
falcon-xu / early-exit-papers
A curated list of early exiting (LLM, CV, NLP, etc)
☆68Updated last year
decoding-comp-trust / comp-trust
Codebase for decoding compressed trust.
☆25Updated last year
p1nksnow / MoICE
Official implementation for "Mixture of In-Context Experts Enhance LLMs’ Awareness of Long Contexts" (Accepted by Neurips2024)
☆13Updated 10 months ago
hdong920 / LESS
☆53Updated last year
thunlp / Modularity-Analysis
[ACL 2023 Findings] Emergent Modularity in Pre-trained Transformers
☆25Updated 2 years ago
cjyaras / deep-lora-transformers
Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)
☆13Updated last year