mlfoundations / model-soupsLinks
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
☆489Updated last year
Alternatives and similar repositories for model-soups
Users that are interested in model-soups are comparing it to the libraries listed below
Sorting:
- A framework for merging models solving different tasks with different initializations into one multi-task model without any additional tr…☆309Updated last year
- Robust fine-tuning of zero-shot models☆744Updated 3 years ago
- Code release for "Dropout Reduces Underfitting"☆315Updated 2 years ago
- ☆683Updated 2 months ago
- Compare neural networks by their feature similarity☆373Updated 2 years ago
- ☆209Updated 2 years ago
- PyTorch code for hierarchical k-means -- a data curation method for self-supervised learning☆215Updated last year
- Editing Models with Task Arithmetic☆508Updated last year
- Official code for our CVPR'22 paper “Vision Transformer Slimming: Multi-Dimension Searching in Continuous Optimization Space”☆250Updated 2 months ago
- Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch☆331Updated 7 months ago
- Collection of Tools and Papers related to Adapters / Parameter-Efficient Transfer Learning/ Fine-Tuning☆199Updated last year
- Fine-tuning Vision Transformers on various classification datasets☆110Updated last year
- This repository contains the implementation for the paper "EMP-SSL: Towards Self-Supervised Learning in One Training Epoch."☆230Updated 2 years ago
- A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.☆410Updated last year
- Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)☆178Updated 4 months ago
- Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch☆366Updated last year
- Official code for "TOAST: Transfer Learning via Attention Steering"☆186Updated 2 years ago
- Learning from synthetic data - code and models☆323Updated last year
- DataComp: In search of the next generation of multimodal datasets☆745Updated 6 months ago
- CLIP-like model evaluation☆780Updated 2 months ago
- Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)☆461Updated 3 years ago
- ☆186Updated last year
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆318Updated last year
- ☆255Updated 4 years ago
- [BMVC 2022] Official repository for "How to Train Vision Transformer on Small-scale Datasets?"☆164Updated last year
- When do we not need larger vision models?☆410Updated 8 months ago
- Official PyTorch implementation of "ML-Decoder: Scalable and Versatile Classification Head" (2021)☆348Updated 2 years ago
- Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper☆775Updated 2 years ago
- ☆210Updated 3 years ago
- FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.☆210Updated 2 years ago