szc12153 / sparse_meta_tuningLinks
Official implementation for Sparse MetA-Tuning (SMAT)
☆16Updated 3 weeks ago
Alternatives and similar repositories for sparse_meta_tuning
Users that are interested in sparse_meta_tuning are comparing it to the libraries listed below
Sorting:
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated last year
- [CVPR'23 Highlight] Heterogeneous Continual Learning.☆16Updated last year
- Recycling diverse models☆45Updated 2 years ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆19Updated last month
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆36Updated 2 years ago
- ☆51Updated last year
- [NeurIPS'24] Official PyTorch implementation for paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"☆22Updated 4 months ago
- ☆13Updated 10 months ago
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆57Updated 7 months ago
- Code accompanying the paper "A contrastive rule for meta-learning"☆12Updated 8 months ago
- ☆23Updated 2 years ago
- Official implementation of Matrix Variational Masked Autoencoder (M-MAE) for ICML paper "Information Flow in Self-Supervised Learning" (h…☆14Updated 9 months ago
- Code for "Merging Text Transformers from Different Initializations"☆20Updated 5 months ago
- ☆26Updated 3 years ago
- ☆12Updated 2 years ago
- Official implementation of ORCA proposed in the paper "Cross-Modal Fine-Tuning: Align then Refine"☆71Updated last year
- Official Pytorch Implementation of Self-emerging Token Labeling☆33Updated last year
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated 2 years ago
- Official code for the paper: "Metadata Archaeology"☆19Updated 2 years ago
- ☆36Updated 2 years ago
- Code for T-MARS data filtering☆35Updated last year
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Updated 2 years ago
- ☆16Updated last year
- Code for "Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?" [ICML 2023]☆36Updated 10 months ago
- Official repo of Progressive Data Expansion: data, code and evaluation☆29Updated last year
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆34Updated last year
- [NeurIPS 2024, spotlight] Multivariate Learned Adaptive Noise for Diffusion Models☆23Updated 7 months ago
- Implementation of the model "Hedgehog" from the paper: "The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry"☆14Updated last year
- Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…☆13Updated last year
- ☆18Updated 2 years ago