luminolx / ScaleNet
ScaleNet: Searching for the Model to Scale (ECCV 2022)
☆12Updated 2 years ago
Alternatives and similar repositories for ScaleNet:
Users that are interested in ScaleNet are comparing it to the libraries listed below
- The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021 best student paper)☆23Updated 2 years ago
- A pytorch implementation of the ICCV2021 workshop paper SimDis: Simple Distillation Baselines for Improving Small Self-supervised Models☆14Updated 3 years ago
- Benchmarking Attention Mechanism in Vision Transformers.☆17Updated 2 years ago
- Normalization Matters in Weakly Supervised Object Localization (ICCV 2021)☆11Updated 3 years ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆27Updated last year
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆53Updated 6 months ago
- Code of our Neurips2020 paper "Auto Learning Attention", coming soon☆21Updated 3 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Updated 3 years ago
- Code of CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping☆17Updated 2 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18Updated last year
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆18Updated 2 years ago
- A huge dataset for Document Visual Question Answering☆15Updated 6 months ago
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆28Updated 2 years ago
- ☆52Updated last year
- ☆16Updated last year
- Localized Vision-Language Matching for Open-vocabulary Object Detection☆20Updated 2 years ago
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆51Updated last year
- SOIT: Segmenting Objects with Instance-Aware Transformers☆14Updated 2 years ago
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆11Updated 8 months ago
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Updated last year
- Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".☆16Updated 3 years ago
- Accepted by AAAI2022☆21Updated 2 years ago
- Cross-modal Hierarchical Modelling for FGSBIR. Work accepted for Oral presentation in BMVC 2020☆17Updated last year
- Official Pytorch implementation for Distilling Image Classifiers in Object detection (NeurIPS2021)☆30Updated 3 years ago
- [FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.☆26Updated 2 years ago
- [ICCV 2021] Official PyTorch implementation for Deep Relational Metric Learning.☆41Updated 3 years ago
- ChineseCLIP using online learning☆12Updated 2 years ago
- An interactive demo based on Segment-Anything for stroke-based painting which enables human-like painting.☆34Updated last year
- Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers☆26Updated 2 years ago