ruitian12 / resformer
Official PyTorch implementation of ResFormer: Scaling ViTs with Multi-Resolution Training, CVPR2023
☆22Updated last year
Related projects: ⓘ
- FastMIM, official pytorch implementation of our paper "FastMIM: Expediting Masked Image Modeling Pre-training for Vision"(https://arxiv.o…☆38Updated last year
- Adaptive Split-Fusion Transformer (ICME 2023 Oral)☆15Updated 7 months ago
- ☆60Updated 2 years ago
- This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"☆25Updated last year
- Adaptive Pyramid Context Network for Semantic Segmentation (APCNet CVPR'2019)☆23Updated 3 years ago
- vit for few-shot classification☆46Updated last year
- ☆35Updated last year
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆27Updated last year
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆61Updated 2 years ago
- ☆22Updated last year
- Official implementation of the paper ``W2N: Switching From Weak Supervision to Noisy Supervision for Object Detection"☆28Updated 2 years ago
- ☆69Updated last month
- ☆17Updated this week
- Official Codes and Pretrained Models for RecursiveMix☆22Updated last year
- Official implementation of the paper ``Weakly Supervised Object Localization as Domain Adaption"☆49Updated 2 years ago
- Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training☆15Updated last year
- ☆32Updated 2 years ago
- [CVPR2022] PyTorch implementation of ''Background Activation Suppression for Weakly Supervised Object Localization''.☆43Updated 11 months ago
- [ICCV 2023] Shrinking Class Space for Enhanced Certainty in Semi-Supervised Learning☆48Updated last year
- Official code for Scale Decoupled Distillation☆29Updated 5 months ago
- Official codes for ConMIM (ICLR 2023)☆57Updated last year
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆41Updated last year
- Official implementation of the paper "Function-Consistent Feature Distillation" (ICLR 2023)☆25Updated last year
- Test different pooling method used in CNN for Computer Vision Task☆35Updated 3 years ago
- Official code for BA-SAM:Scalable Bias-Mode Attention Mask for Segment Anything Model☆10Updated 2 months ago
- ☆19Updated last year
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆67Updated 3 weeks ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆54Updated 2 months ago
- ☆32Updated 10 months ago
- ☆10Updated 2 months ago