YehLi / ImageNetModel
Official ImageNet Model repository
☆212Updated last year
Related projects: ⓘ
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "Fast Vision Transformers with HiLo Attention"☆239Updated 10 months ago
- ☆116Updated 6 months ago
- ☆131Updated 2 weeks ago
- ☆118Updated 2 months ago
- ☆79Updated last year
- TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition☆109Updated 9 months ago
- Official repository of Slide-Transformer (CVPR2023)☆157Updated 3 weeks ago
- This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆186Updated last year
- Lite Vision Transformer (CVPR 2022)☆134Updated last year
- The official code of "Rethinking Local Perception in Lightweight Vision Transformer"☆83Updated last year
- iFormer: Inception Transformer☆239Updated last year
- CMT Pytorch implementation of our CVPR 2022 paper CMT: Convolutional Neural Networks Meet Vision Transformers (https://arxiv.org/pdf/2107…☆92Updated 2 years ago
- Code Implementation of EfficientVMamba☆172Updated 5 months ago
- Scattering Vision Transformer☆45Updated 6 months ago
- ✨✨Latest Papers on Vision Mamba and Related Areas☆177Updated this week
- An unofficial implementation for Detecting Camouflaged Object in Frequency Domain, CVPR 2022 in PyTorch.☆65Updated 2 years ago
- [TPAMI22] Pyramid Pooling Transformer for Scene Understanding☆198Updated last year
- GroupMixAttention and GroupMixFormer☆108Updated 9 months ago
- ☆145Updated last year
- CMT: Convolutional Neural Networks Meet Vision Transformers☆118Updated 2 years ago
- PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].☆157Updated last year
- (CVPR2024)RMT: Retentive Networks Meet Vision Transformer☆272Updated last month
- ☆209Updated 2 years ago
- InceptionNeXt: When Inception Meets ConvNeXt (CVPR 2024)☆225Updated 10 months ago
- Wavelet Convolutions for Large Receptive Fields. ECCV 2024.☆141Updated 2 months ago
- Official implementation of CrossViT. https://arxiv.org/abs/2103.14899☆343Updated 2 years ago
- ☆46Updated last year
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆191Updated 4 months ago
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆136Updated 2 years ago
- A Simple but Effective Downsampling Module For Semantic Segmentation☆58Updated last year