KimManjin / StructViT
The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.
☆47Updated 9 months ago
Alternatives and similar repositories for StructViT:
Users that are interested in StructViT are comparing it to the libraries listed below
- Vision Mamba: A Comprehensive Survey and Taxonomy☆81Updated 4 months ago
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆58Updated 3 weeks ago
- ☆83Updated last year
- Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion☆65Updated 2 months ago
- ☆132Updated 6 months ago
- Official implementation of paper titled "GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model"☆64Updated 5 months ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆50Updated 6 months ago
- ☆52Updated 10 months ago
- ☆38Updated 2 weeks ago
- Vision Mamba 2: More Efficient Visual Representation Learning with State Space Duality☆27Updated 7 months ago
- The official code of "Rethinking Local Perception in Lightweight Vision Transformer"☆85Updated last year
- [NeurIPS2024 Spotlight] The official implementation of GrootVL: Tree Topology is All You Need in State Space Model☆89Updated 7 months ago
- ☆52Updated last year
- PEM: Prototype-based Efficient MaskFormer for Image Segmentation☆83Updated last month
- SCSA: Exploring the Synergistic Effects Between Spatial and Channel Attention.☆59Updated 6 months ago
- CVPR 2024 Highlight: Frequency-Adaptive Dilated Convolution for Semantic Segmentation☆125Updated 3 weeks ago
- Official implementation of SPANet in ICCV2023☆23Updated 7 months ago
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆222Updated 8 months ago
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆60Updated 3 weeks ago
- GroupMixAttention and GroupMixFormer☆114Updated last year
- [ECCV 2024] Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation☆77Updated 3 months ago
- This repository is the official implementation of CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion Models.☆55Updated last year
- [CVPR'23] A Simple Framework for Text-Supervised Semantic Segmentation☆58Updated last year
- Code release for "VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt Learning"☆38Updated 6 months ago
- ☆16Updated 2 months ago
- The official implementation for ALOFT (CVPR 2023).☆52Updated last year
- This is the offical repository for "Multi-modal Gated Mixture of Local-to-Global Experts for Dynamic Image Fusion" (ICCV 2023).☆48Updated 8 months ago
- Scattering Vision Transformer☆50Updated 10 months ago
- (ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation☆21Updated 2 months ago
- (ICML 2024) Spider: A Unified Framework for Context-dependent Concept Segmentation☆54Updated 3 months ago