rentainhe / pytorch-pooling
Test different pooling method used in CNN for Computer Vision Task
☆35Updated 4 years ago
Alternatives and similar repositories for pytorch-pooling:
Users that are interested in pytorch-pooling are comparing it to the libraries listed below
- FastMIM, official pytorch implementation of our paper "FastMIM: Expediting Masked Image Modeling Pre-training for Vision"(https://arxiv.o…☆39Updated 2 years ago
- code base for vision transformers☆36Updated 3 years ago
- Lightweight Transformer for Multi-modal Tasks☆15Updated 2 years ago
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆28Updated 2 years ago
- Adaptive Split-Fusion Transformer (ICME 2023 Oral)☆16Updated last year
- The official repo of the CVPR2021 oral paper: Representative Batch Normalization with Feature Calibration☆86Updated 2 years ago
- Official codes for ConMIM (ICLR 2023)☆58Updated 2 years ago
- Official PyTorch implementation of ResFormer: Scaling ViTs with Multi-Resolution Training, CVPR2023☆27Updated last year
- Code and models for the paper Glance-and-Gaze Vision Transformer☆28Updated 3 years ago
- ☆71Updated 3 weeks ago
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆67Updated 2 years ago
- ☆57Updated 2 years ago
- This is novel noisy-robust Attentive Feature MixUp method.☆21Updated 4 years ago
- Official implementation of the paper "Function-Consistent Feature Distillation" (ICLR 2023)☆28Updated last year
- This is an unofficial implementation of BOAT: Bilateral Local Attention Vision Transformer☆54Updated 3 years ago
- Official implementation of the paper ``Unifying Nonlocal Blocks for Neural Networks'' (ICCV'21)☆98Updated 3 years ago
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆93Updated 2 years ago
- ☆57Updated 3 years ago
- Official implementation of the paper ``Weakly Supervised Object Localization as Domain Adaption"☆50Updated 2 years ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆20Updated 3 years ago
- ☆33Updated 3 years ago
- ☆42Updated 3 years ago
- This project is the PyTorch implementation of our CVPR 2022 paper:☆26Updated 2 years ago
- ☆49Updated 3 years ago
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆63Updated 2 years ago
- Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers☆26Updated 2 years ago
- This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"☆26Updated last year
- Teach-DETR: Better Training DETR with Teachers☆31Updated last year
- ☆21Updated 3 years ago
- ☆25Updated 3 years ago