mlfoundations / model-soups
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
☆462Updated 9 months ago
Alternatives and similar repositories for model-soups:
Users that are interested in model-soups are comparing it to the libraries listed below
- A framework for merging models solving different tasks with different initializations into one multi-task model without any additional tr…☆298Updated last year
- Code release for "Dropout Reduces Underfitting"☆313Updated 2 years ago
- Robust fine-tuning of zero-shot models☆697Updated 3 years ago
- ☆629Updated last week
- PyTorch code for hierarchical k-means -- a data curation method for self-supervised learning☆151Updated 10 months ago
- DataComp: In search of the next generation of multimodal datasets☆703Updated last week
- ☆201Updated last year
- Official code for "TOAST: Transfer Learning via Attention Steering"☆189Updated last year
- This repository contains the implementation for the paper "EMP-SSL: Towards Self-Supervised Learning in One Training Epoch."☆227Updated last year
- Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch☆286Updated last month
- Learning from synthetic data - code and models☆315Updated last year
- Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)☆456Updated 2 years ago
- Official code for our CVPR'22 paper “Vision Transformer Slimming: Multi-Dimension Searching in Continuous Optimization Space”☆248Updated last year
- Editing Models with Task Arithmetic☆468Updated last year
- Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch☆328Updated 10 months ago
- ☆179Updated 7 months ago
- Low rank adaptation for Vision Transformer☆402Updated last year
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆314Updated 11 months ago
- Collection of Tools and Papers related to Adapters / Parameter-Efficient Transfer Learning/ Fine-Tuning☆191Updated last year
- FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.☆207Updated last year
- Fine-tuning Vision Transformers on various classification datasets☆107Updated 8 months ago
- A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.☆400Updated 7 months ago
- ☆176Updated last year
- Hiera: A fast, powerful, and simple hierarchical vision transformer.☆977Updated last year
- Official Open Source code for "Scaling Language-Image Pre-training via Masking"☆420Updated 2 years ago
- A method to increase the speed and lower the memory footprint of existing vision transformers.☆1,046Updated 10 months ago
- Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)☆165Updated last year
- [ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decode…☆849Updated last year
- ☆515Updated 5 months ago
- CLIP-like model evaluation☆703Updated last month