[CVPR 2024] SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design
☆136Jun 14, 2024Updated last year
Alternatives and similar repositories for SHViT
Users that are interested in SHViT are comparing it to the libraries listed below
Sorting:
- [ECCV 2024 Oral] Official implementation of the paper "PDiscoFormer: Relaxing Part Discovery Constraints with Vision Transformers"☆18Jul 3, 2025Updated 8 months ago
- [ICLR 2024] MogaNet: Efficient Multi-order Gated Aggregation Network☆266Jun 28, 2025Updated 8 months ago
- ☆21May 7, 2024Updated last year
- ☆37Oct 17, 2025Updated 4 months ago
- RepViT: Revisiting Mobile CNN From ViT Perspective [CVPR 2024] and RepViT-SAM: Towards Real-Time Segmenting Anything☆1,065Jun 14, 2024Updated last year
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆108Aug 23, 2024Updated last year
- Official Pytorch implementation for "IFORMER: INTEGRATING CONVNET AND TRANSFORMER FOR MOBILE APPLICATION" [ICLR 2025]☆63Mar 22, 2025Updated 11 months ago
- FFNet: MetaMixer-based Efficient Convolutional Mixer Design☆31Mar 11, 2025Updated 11 months ago
- Orthogonal Channel Attentions Networks☆53Nov 7, 2023Updated 2 years ago
- [CVPR 2024] Rewrite the Stars☆446May 7, 2024Updated last year
- ☆65Jun 12, 2024Updated last year
- ☆50Mar 14, 2025Updated 11 months ago
- An unofficial implementation of MobileNetV4 in Pytorch☆212Mar 19, 2025Updated 11 months ago
- Official implementation of "ImagineFSL: Self-Supervised Pretraining Matters on Imagined Base Set for VLM-based Few-shot Learning" [CVPR 2…☆25Sep 1, 2025Updated 6 months ago
- Implementation of paper - RepVGG-GELAN: ENHANCED GELAN WITH VGG-STYLE CONVNETS FOR BRAIN TUMOR DETECTION☆10Jul 19, 2025Updated 7 months ago
- ☆12Jan 2, 2025Updated last year
- [ICCV 2025] Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction☆52Sep 22, 2025Updated 5 months ago
- [TGRS2025] Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images☆73Aug 26, 2024Updated last year
- [CVPR25] Official implementation of `MobileMamba: Lightweight Multi-Receptive Visual Mamba Network.'☆355Mar 20, 2025Updated 11 months ago
- DAWN: Direction-aware Attention Wavelet Network for Image Deraining☆11Jan 7, 2024Updated 2 years ago
- HSViT: Horizontally Scalable Vision Transformer☆13Nov 6, 2024Updated last year
- Source code of our TNNLS paper "Boosting Convolutional Neural Networks with Middle Spectrum Grouped Convolution"☆12Apr 14, 2023Updated 2 years ago
- Try to export the ONNX QDQ model that conforms to the AXERA NPU quantization specification. Currently, only w8a8 is supported.☆11Sep 10, 2024Updated last year
- This repository presents a novel hybrid deep learning architecture that combines the strengths of both ResNet and Vision Transformer (ViT…☆11Sep 25, 2023Updated 2 years ago
- Official repository of paper titled "CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications…☆87Jan 15, 2026Updated last month
- [ICCV - 2023] Official repository of paper SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applic…☆311Jul 18, 2025Updated 7 months ago
- [ECCV 2024] Official repository of Agent Attention☆661Nov 17, 2024Updated last year
- [ICLR 2023] Selective Frequency Network for Image Restoration☆150Feb 5, 2025Updated last year
- This is Pytorch implementation of our paper "LF-ViT: Reducing Spatial Redundancy in Vision Transformer for Efficient Image Recognition".☆11Sep 23, 2024Updated last year
- Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration☆54Jul 13, 2025Updated 7 months ago
- 【ACM MM 2025】PyTorch code for our paper "Cross Paradigm Representation and Alignment Transformer for Image Deraining"☆70Dec 19, 2025Updated 2 months ago
- Cross Visual Prompt Tuning [ICCV 2025]☆13Aug 3, 2025Updated 7 months ago
- RepNeXt: A Fast Multi-Scale CNN using Structural Reparameterization☆48Oct 13, 2024Updated last year
- ☆12Jan 30, 2024Updated 2 years ago
- SEM-Net: Efficient Pixel Modelling for image inpainting with Spatially Enhanced SSM☆41Apr 23, 2025Updated 10 months ago
- [CVPR 2025 Highlight] SoMA: Singular Value Decomposed Minor Components Adaptation for Domain Generalizable Representation Learning☆61Jun 26, 2025Updated 8 months ago
- (CVPR2024)RMT: Retentive Networks Meet Vision Transformer☆382Jul 29, 2024Updated last year
- ☆16Mar 14, 2024Updated last year
- Fifty Years of SAR Automatic Target Recognition: The Road Forward (2025)☆44Nov 11, 2025Updated 3 months ago