KimManjin / StructViT
The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.
☆47Updated 10 months ago
Alternatives and similar repositories for StructViT:
Users that are interested in StructViT are comparing it to the libraries listed below
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆58Updated last month
- Vision Mamba: A Comprehensive Survey and Taxonomy☆83Updated 5 months ago
- [ICLR2025] Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion☆76Updated 3 months ago
- ☆132Updated 7 months ago
- ☆83Updated last year
- Official implementation of paper titled "GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model"☆65Updated this week
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆51Updated 7 months ago
- PEM: Prototype-based Efficient MaskFormer for Image Segmentation☆85Updated 2 months ago
- Scattering Vision Transformer☆50Updated 11 months ago
- Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model☆19Updated 11 months ago
- The official code of "Rethinking Local Perception in Lightweight Vision Transformer"☆85Updated last year
- ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.☆33Updated 7 months ago
- [NeurIPS2024 Spotlight] The official implementation of GrootVL: Tree Topology is All You Need in State Space Model☆89Updated 8 months ago
- ☆40Updated last month
- ☆135Updated 5 months ago
- ☆55Updated 11 months ago
- Vision Mamba 2: More Efficient Visual Representation Learning with State Space Duality☆27Updated 8 months ago
- CVPR 2024 Highlight: Frequency-Adaptive Dilated Convolution for Semantic Segmentation☆129Updated last month
- [ECCV 2024] Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation☆78Updated 4 months ago
- GroupMixAttention and GroupMixFormer☆115Updated last year
- Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation (CVPR 2024)☆43Updated 3 months ago
- [CVPR 2024] SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design☆87Updated 8 months ago
- TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition☆170Updated last year
- [ICCV-2023] The official code of Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation☆104Updated last month
- Code Implementation of EfficientVMamba☆194Updated 10 months ago
- The official pytorch implementation of "SCSA: Exploring the Synergistic Effects Between Spatial and Channel Attention".☆60Updated 7 months ago
- Pan-Mamba: Effective Pan-Sharpening with State Space Model☆97Updated 11 months ago
- ☆57Updated 6 months ago
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆185Updated 6 months ago
- ☆20Updated 5 months ago