SwinTransformer / Feature-Distillation
☆243Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Feature-Distillation
- Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"☆238Updated last year
- [CVPR 2022 Oral] Crafting Better Contrastive Views for Siamese Representation Learning☆284Updated 2 years ago
- [ICCV 2023] You Only Look at One Partial Sequence☆336Updated last year
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆194Updated last year
- [NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"☆326Updated 2 years ago
- [Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)☆208Updated 2 years ago
- [NeurIPS2022] Official implementation of the paper 'Green Hierarchical Vision Transformer for Masked Image Modeling'.☆170Updated last year
- [ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"☆329Updated 8 months ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆483Updated last year
- CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet☆209Updated last year
- [NeurIPS 2021 Spotlight] Aligning Pretraining for Detection via Object-Level Contrastive Learning☆173Updated 2 years ago
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆99Updated last year
- ☆210Updated 2 years ago
- reproduction of semantic segmentation using masked autoencoder (mae)☆155Updated 2 years ago
- [ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design☆192Updated 11 months ago
- ☆81Updated last year
- Python code for ICLR 2022 spotlight paper EViT: Expediting Vision Transformers via Token Reorganizations☆170Updated last year
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆175Updated 3 months ago
- [CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting☆518Updated last year
- FastMIM, official pytorch implementation of our paper "FastMIM: Expediting Masked Image Modeling Pre-training for Vision"(https://arxiv.o…☆39Updated last year
- Open-vocabulary Semantic Segmentation☆166Updated last year
- Official Code of Paper "Reversible Column Networks" "RevColv2"☆249Updated last year
- PromptDet: Towards Open-vocabulary Detection using Uncurated Images, ECCV2022☆160Updated 2 years ago
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆92Updated 2 years ago
- Official PyTorch implementation of A-ViT: Adaptive Tokens for Efficient Vision Transformer (CVPR 2022)☆149Updated 2 years ago
- A DETR-style framework for open-vocabulary detection (OVD). CVPR 2023☆174Updated last year
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆61Updated 2 years ago
- (TPAMI2022) The ImageNet-S benchmark/method for large-scale unsupervised/semi-supervised semantic segmentation.☆168Updated last year
- Reading list for research topics in Masked Image Modeling☆331Updated 4 months ago
- MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning☆128Updated last year