szc12153 / sparse_meta_tuningLinks

Official implementation for Sparse MetA-Tuning (SMAT)

☆16

Alternatives and similar repositories for sparse_meta_tuning

Users that are interested in sparse_meta_tuning are comparing it to the libraries listed below

Sorting:

SamsungSAILMontreal / PAPA
Repository for the PopulAtion Parameter Averaging (PAPA) paper
☆26Updated last year
NVlabs / HCL
[CVPR'23 Highlight] Heterogeneous Continual Learning.
☆16Updated last year
facebookresearch / ModelRatatouille
Recycling diverse models
☆45Updated 2 years ago
SamsungSAILMontreal / nino
Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]
☆19Updated last month
facebookresearch / DejaVu
Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning
☆36Updated 2 years ago
gregorbachmann / scaling_mlps
☆51Updated last year
fredzzhang / atlas
[NeurIPS'24] Official PyTorch implementation for paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"
☆22Updated 4 months ago
Netflix / clove
☆13Updated 10 months ago
ExplainableML / fomo_in_flux
Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]
☆57Updated 7 months ago
smonsays / contrastive-meta-learning
Code accompanying the paper "A contrastive rule for meta-learning"
☆12Updated 8 months ago
sjunhongshen / DASH
☆23Updated 2 years ago
yifanzhang-pro / M-MAE
Official implementation of Matrix Variational Masked Autoencoder (M-MAE) for ICML paper "Information Flow in Self-Supervised Learning" (h…
☆14Updated 9 months ago
nverma1 / merging-text-transformers
Code for "Merging Text Transformers from Different Initializations"
☆20Updated 5 months ago
AllanYangZhou / generative-invariance-transfer
☆26Updated 3 years ago
stanislavfort / adversaries_to_OOD_detection
☆12Updated 2 years ago
sjunhongshen / ORCA
Official implementation of ORCA proposed in the paper "Cross-Modal Fine-Tuning: Align then Refine"
☆71Updated last year
NVlabs / STL
Official Pytorch Implementation of Self-emerging Token Labeling
☆33Updated last year
facebookresearch / ViP-MAE
This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision
☆36Updated 2 years ago
shoaibahmed / metadata_archaeology
Official code for the paper: "Metadata Archaeology"
☆19Updated 2 years ago
MadryLab / data-transfer
☆36Updated 2 years ago
locuslab / T-MARS
Code for T-MARS data filtering
☆35Updated last year
alexrame / diwa
DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization
☆31Updated 2 years ago
RandallBalestriero / SplineLLM
☆16Updated last year
SamsungSAILMontreal / ghn3
Code for "Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?" [ICML 2023]
☆36Updated 10 months ago
uclaml / PDE
Official repo of Progressive Data Expansion: data, code and evaluation
☆29Updated last year
aszala / EnvGen
Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)
☆34Updated last year
s-sahoo / MuLAN
[NeurIPS 2024, spotlight] Multivariate Learned Adaptive Noise for Diffusion Models
☆23Updated 7 months ago
kyegomez / Hedgehog
Implementation of the model "Hedgehog" from the paper: "The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry"
☆14Updated last year
tianyu139 / tangent-model-composition
Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…
☆13Updated last year
JonasGeiping / dataaugs
☆18Updated 2 years ago