scu-zjz / SparseViT
Official repository for the AAAI2025 paper (Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization through Spare-Coding Transformer)
☆35Updated 2 months ago
Alternatives and similar repositories for SparseViT:
Users that are interested in SparseViT are comparing it to the libraries listed below
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆100Updated 3 weeks ago
- [CVPR25] Official implementation of `MobileMamba: Lightweight Multi-Receptive Visual Mamba Network.'☆151Updated 2 weeks ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆52Updated 9 months ago
- Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)☆56Updated last month
- Vision Mamba: A Comprehensive Survey and Taxonomy☆88Updated 7 months ago
- ☆83Updated last year
- [ICLR2025] Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion☆120Updated 3 weeks ago
- CVPR 2025: Frequency Dynamic Convolution for Dense Image Prediction☆44Updated last week
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆69Updated 3 months ago
- GroupMixAttention and GroupMixFormer☆115Updated last year
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆74Updated 7 months ago
- [AAAI 2025] SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks☆27Updated this week
- ☆59Updated last year
- [NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model☆92Updated 9 months ago
- Code Implementation of EfficientVMamba☆203Updated 11 months ago
- [CVPR 2025] SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation☆49Updated this week
- [CVPR 2025] OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels☆23Updated last month
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆199Updated last year
- [CVPR'24] Official implementation of paper "FreeKD: Knowledge Distillation via Semantic Frequency Prompt".☆41Updated 11 months ago
- CVPR 2024 Highlight: Frequency-Adaptive Dilated Convolution for Semantic Segmentation☆138Updated 3 months ago
- (ICML 2024) Spider: A Unified Framework for Context-dependent Concept Segmentation☆57Updated last week
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆100Updated 7 months ago
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆82Updated last week
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆48Updated last year
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆237Updated 10 months ago
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation☆114Updated last year
- ☆63Updated last month
- ☆129Updated 2 years ago
- ☆71Updated 7 months ago
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆189Updated 8 months ago