szc12153 / sparse_interpolated_expertsLinks
Official implementation for Sparse MetA-Tuning (SMAT)
☆18Updated 3 weeks ago
Alternatives and similar repositories for sparse_interpolated_experts
Users that are interested in sparse_interpolated_experts are comparing it to the libraries listed below
Sorting:
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆57Updated 8 months ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆56Updated last year
- ☆39Updated last year
- Official implementation of ORCA proposed in the paper "Cross-Modal Fine-Tuning: Align then Refine"☆72Updated last year
- [NeurIPS 2024] RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models☆25Updated 9 months ago
- Official code for `Visual Attention Emerges from Recurrent Sparse Reconstruction' (ICML 2022)☆36Updated 3 years ago
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated 2 years ago
- ☆38Updated last year
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights☆27Updated 10 months ago
- [NeurIPS'24] Official PyTorch implementation for paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"☆24Updated 6 months ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆53Updated last year
- Code for the paper Self-Supervised Learning of Split Invariant Equivariant Representations☆29Updated last year
- Official Code for NeurIPS 2022 Paper: How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders☆68Updated last year
- [NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization☆33Updated 10 months ago
- Patching open-vocabulary models by interpolating weights☆91Updated last year
- ☆25Updated 3 years ago
- Code for ICLR'24 workshop ME-FoMo-How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation☆37Updated 10 months ago
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…☆25Updated 3 years ago
- Code for T-MARS data filtering☆35Updated 2 years ago
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Updated last year
- Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"☆105Updated 2 years ago
- Test-Time Distribution Normalization For Contrastively Learned Vision-language Models☆27Updated last year
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Updated 2 years ago
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆47Updated 9 months ago
- Recycling diverse models☆45Updated 2 years ago
- CatMAE☆14Updated last year
- ImageNetV2 Pytorch Dataset☆41Updated 2 years ago
- Original code base for On Pretraining Data Diversity for Self-Supervised Learning☆13Updated 7 months ago
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"☆61Updated 2 years ago
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆31Updated last year