scu-zjz / SparseViT
Official repository for the AAAI2025 paper (Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization through Spare-Coding Transformer)
☆41Updated 4 months ago
Alternatives and similar repositories for SparseViT
Users that are interested in SparseViT are comparing it to the libraries listed below
Sorting:
- [AAAI 2025] SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks☆32Updated last month
- [AAAI 2025] Official repository of paper “Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for Image Manipulation Loc…☆40Updated 2 months ago
- Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)☆62Updated this week
- The official implementation of [CVPR 2025] "5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks".☆271Updated last week
- [CVPR25] Official implementation of `MobileMamba: Lightweight Multi-Receptive Visual Mamba Network.'☆198Updated last month
- [ICLR2025] Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion☆136Updated 2 months ago
- ☆86Updated last year
- [KDD2025] Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective☆65Updated last month
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆53Updated 10 months ago
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆69Updated 4 months ago
- The official repository of Real Text Manipulation (RTM)☆35Updated 2 months ago
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆48Updated last year
- [ACCV 2024 ] Official code for "DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention"☆29Updated 4 months ago
- ☆84Updated last year
- This is the offical repository for "Multi-modal Gated Mixture of Local-to-Global Experts for Dynamic Image Fusion" (ICCV 2023).☆56Updated last year
- Vision Mamba: A Comprehensive Survey and Taxonomy☆91Updated 8 months ago
- ☆41Updated 6 months ago
- [CVPR 2025] SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation☆78Updated 2 weeks ago
- An unofficial implementation for Detecting Camouflaged Object in Frequency Domain, CVPR 2022 in PyTorch.☆81Updated 2 years ago
- CVPR 2025: Frequency Dynamic Convolution for Dense Image Prediction☆103Updated 3 weeks ago
- The official code of "Rethinking Local Perception in Lightweight Vision Transformer"☆87Updated 2 years ago
- [CVPR 2024] LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion.☆46Updated 3 months ago
- FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba☆162Updated last month
- LSNet: See Large, Focus Small [CVPR 2025]☆105Updated last month
- ☆60Updated last year
- GroupMixAttention and GroupMixFormer☆116Updated last year
- Implementation for Enhancing Tampered Text Detection Through Frequency Feature Fusion and Decomposition☆12Updated 2 months ago
- [CVPR 2024] SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design☆101Updated 11 months ago
- [CVPR 2024] The official pytorch implementation of "A General and Efficient Training for Transformer via Token Expansion".☆44Updated last year
- CVPR 2024 Highlight: Frequency-Adaptive Dilated Convolution for Semantic Segmentation☆140Updated 4 months ago