luminolx / ScaleNet
ScaleNet: Searching for the Model to Scale (ECCV 2022)
☆12Updated 2 years ago
Alternatives and similar repositories for ScaleNet:
Users that are interested in ScaleNet are comparing it to the libraries listed below
- A pytorch implementation of the ICCV2021 workshop paper SimDis: Simple Distillation Baselines for Improving Small Self-supervised Models☆14Updated 3 years ago
- The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021 best student paper)☆23Updated 2 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Updated 3 years ago
- ☆16Updated last year
- Benchmarking Attention Mechanism in Vision Transformers.☆17Updated 2 years ago
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆55Updated 7 months ago
- SOIT: Segmenting Objects with Instance-Aware Transformers☆14Updated 2 years ago
- Code of CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping☆17Updated 2 years ago
- Normalization Matters in Weakly Supervised Object Localization (ICCV 2021)☆11Updated 3 years ago
- A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)☆15Updated 3 years ago
- Official Pytorch implementation for Distilling Image Classifiers in Object detection (NeurIPS2021)☆30Updated 3 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18Updated last year
- Rethinking Nearest Neighbors for Visual Classification☆31Updated 3 years ago
- [Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training☆21Updated 3 years ago
- [FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.☆26Updated 2 years ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Updated last year
- i-mae Pytorch Repo☆20Updated 11 months ago
- ☆52Updated 2 years ago
- Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training☆16Updated last month
- Localized Vision-Language Matching for Open-vocabulary Object Detection☆21Updated 2 years ago
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆32Updated last year
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆18Updated 2 years ago
- Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning☆20Updated 3 years ago
- Lightweight Transformer for Multi-modal Tasks☆15Updated 2 years ago
- Bag of Instances Aggregation Boosts Self-supervised Distillation (ICLR 2022)☆33Updated 2 years ago
- (CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers☆25Updated 3 weeks ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆17Updated 11 months ago
- Implementation for <Orthogonal Over-Parameterized Training> in CVPR'21.☆19Updated 3 years ago
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆28Updated 2 years ago
- support Large Vocabulary Instance Segmentation (LVIS) dataset for mmdetection☆16Updated 4 years ago