mlfoundations / model-soupsLinks

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time

☆487

Alternatives and similar repositories for model-soups

Users that are interested in model-soups are comparing it to the libraries listed below

Sorting:

mlfoundations / wise-ft
Robust fine-tuning of zero-shot models
☆742Updated 3 years ago
gstoica27 / ZipIt
A framework for merging models solving different tasks with different initializations into one multi-task model without any additional tr…
☆307Updated last year
facebookresearch / dropout
Code release for "Dropout Reduces Underfitting"
☆315Updated 2 years ago
google-research / vmoe
☆680Updated 2 months ago
facebookresearch / ssl-data-curation
PyTorch code for hierarchical k-means -- a data curation method for self-supervised learning
☆197Updated last year
lucidrains / soft-moe-pytorch
Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch
☆327Updated 6 months ago
tsb0601 / EMP-SSL
This repository contains the implementation for the paper "EMP-SSL: Towards Self-Supervised Learning in One Training Epoch."
☆229Updated 2 years ago
AntixK / PyTorch-Model-Compare
Compare neural networks by their feature similarity
☆374Updated 2 years ago
Arnav0400 / ViT-Slim
Official code for our CVPR'22 paper “Vision Transformer Slimming: Multi-Dimension Searching in Continuous Optimization Space”
☆250Updated last month
hsouri / Battle-of-the-Backbones
☆209Updated last year
calpt / awesome-adapter-resources
Collection of Tools and Papers related to Adapters / Parameter-Efficient Transfer Learning/ Fine-Tuning
☆198Updated last year
mlfoundations / task_vectors
Editing Models with Task Arithmetic
☆504Updated last year
facebookresearch / msn
Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)
☆458Updated 3 years ago
lucidrains / st-moe-pytorch
Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch
☆362Updated last year
bfshi / TOAST
Official code for "TOAST: Transfer Learning via Attention Steering"
☆186Updated 2 years ago
bwconrad / vit-finetune
Fine-tuning Vision Transformers on various classification datasets
☆109Updated last year
LAION-AI / CLIP_benchmark
CLIP-like model evaluation
☆773Updated last month
facebookresearch / FFCV-SSL
FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.
☆208Updated 2 years ago
LAION-AI / scaling-laws-openclip
Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)
☆177Updated 3 months ago
mlfoundations / datacomp
DataComp: In search of the next generation of multimodal datasets
☆742Updated 5 months ago
microsoft / esvit
EsViT: Efficient self-supervised Vision Transformers
☆412Updated 2 years ago
facebookresearch / ToMe
A method to increase the speed and lower the memory footprint of existing vision transformers.
☆1,105Updated last year
google-research / syn-rep-learn
Learning from synthetic data - code and models
☆322Updated last year
jianghaojun / Awesome-Parameter-Efficient-Transfer-Learning
A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.
☆407Updated last year
samiraabnar / attention_flow
☆253Updated 4 years ago
xxxnell / how-do-vits-work
(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"
☆819Updated 3 years ago
Alibaba-MIIL / ImageNet21K
Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper
☆775Updated 2 years ago
Sense-GVT / DeCLIP
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
☆666Updated 3 years ago
UCSC-VLAA / CLIPA
[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"
☆317Updated last year
hananshafi / vits-for-small-scale-datasets
[BMVC 2022] Official repository for "How to Train Vision Transformer on Small-scale Datasets?"
☆162Updated last year