scu-zjz / SparseViT
Official repository for the AAAI2025 paper (Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization through Spare-Coding Transformer)
☆41Updated 3 months ago
Alternatives and similar repositories for SparseViT:
Users that are interested in SparseViT are comparing it to the libraries listed below
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆168Updated last week
- [AAAI 2025] SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks☆31Updated 3 weeks ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆52Updated 9 months ago
- Vision Mamba: A Comprehensive Survey and Taxonomy☆90Updated 7 months ago
- [ICLR2025] Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion☆129Updated last month
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆48Updated last year
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆69Updated 4 months ago
- ☆75Updated last year
- [CVPR 25] Official Implementation (Pytorch) of "EfficientViM: Efficient Vision Mamba with Hidden State Mixer-based State Space Duality"☆52Updated this week
- Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)☆61Updated 2 months ago
- ☆83Updated last year
- [CVPR 2025] SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation☆68Updated 3 weeks ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆76Updated 3 weeks ago
- [CVPR25] Official implementation of `MobileMamba: Lightweight Multi-Receptive Visual Mamba Network.'☆185Updated last month
- GroupMixAttention and GroupMixFormer☆115Updated last year
- ☆39Updated 5 months ago
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation☆115Updated last year
- ICLR2024 When Sementic Segmentation Meets Frequency Aliasing☆42Updated 10 months ago
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆200Updated last year
- visualization:filter、feature map、attention map、image-mask、grad-cam、human keypoint、guided-backpro☆117Updated 2 years ago
- ☆85Updated 11 months ago
- ☆60Updated last year
- Official implementation of SPANet in ICCV2023☆23Updated 10 months ago
- [CVPR 2024] LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion.☆46Updated 3 months ago
- This is the offical repository for "Multi-modal Gated Mixture of Local-to-Global Experts for Dynamic Image Fusion" (ICCV 2023).☆52Updated 11 months ago
- [AAAI 2025] Official repository of paper “Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for Image Manipulation Loc…☆34Updated last month
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆244Updated 11 months ago
- FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba☆157Updated last week
- The official code of "Rethinking Local Perception in Lightweight Vision Transformer"☆86Updated last year
- Official Pytorch implementations for "MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation" (WACV …☆39Updated 7 months ago