badripatro / svtLinks
Scattering Vision Transformer 
☆53Updated last year
Alternatives and similar repositories for svt
Users that are interested in svt are comparing it to the libraries listed below
Sorting:
- ☆85Updated 2 years ago
- ☆151Updated last year
- ☆147Updated last year
- ☆68Updated last year
- Official ImageNet Model repository☆255Updated 2 years ago
- ICLR2024 When Sementic Segmentation Meets Frequency Aliasing☆45Updated last year
- CVPR 2024 Highlight: Frequency-Adaptive Dilated Convolution for Semantic Segmentation☆159Updated 10 months ago
- Official code release of our paper "FViT: A Focal Vision Transformer with Gabor Filter"☆18Updated 2 months ago
- Trainable Highly-expressive Activation Functions. ECCV 2024☆38Updated 8 months ago
- ☆152Updated last year
- CMT Pytorch implementation of our CVPR 2022 paper CMT: Convolutional Neural Networks Meet Vision Transformers (https://arxiv.org/pdf/2107…☆99Updated 3 years ago
- ☆123Updated last year
- GroupMixAttention and GroupMixFormer☆116Updated last year
- The official repository implement of Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with…☆79Updated 2 months ago
- The official implementation for ALOFT (CVPR 2023).☆56Updated 2 years ago
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆211Updated 2 years ago
- Vision Mamba: A Comprehensive Survey and Taxonomy☆96Updated last year
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆76Updated 10 months ago
- Orthogonal Channel Attentions Networks☆52Updated last year
- Official Pytorch implementations for "MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation" (WACV …☆40Updated last year
- CVPR2024 Frequency-Adaptive Dilated Convolution☆34Updated last year
- [ICCV2025] Official Pytorch Implementation of TinyViM☆92Updated 3 months ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "Fast Vision Transformers with HiLo Attention"☆289Updated last year
- (ICCV'23) Learning to Upsample by Learning to Sample☆160Updated last year
- [TNNLS 2025] TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition☆219Updated 4 months ago
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆122Updated 7 months ago
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆47Updated last year
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆267Updated last year
- ☆237Updated last year
- Official repository of Slide-Transformer (CVPR2023)☆170Updated last year