xinghaochen / SLAB
[ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization"
☆97Updated 6 months ago
Alternatives and similar repositories for SLAB:
Users that are interested in SLAB are comparing it to the libraries listed below
- ☆83Updated last year
- ☆59Updated last year
- [ICLR2025] Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion☆111Updated 2 weeks ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆51Updated 8 months ago
- Code Implementation of EfficientVMamba☆203Updated 11 months ago
- [CVPR 2024] SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design☆94Updated 9 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆74Updated 7 months ago
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆198Updated last year
- Vision Mamba 2: More Efficient Visual Representation Learning with State Space Duality☆27Updated 9 months ago
- Official repository of Slide-Transformer (CVPR2023)☆167Updated 6 months ago
- Vision Mamba: A Comprehensive Survey and Taxonomy☆84Updated 6 months ago
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆235Updated 10 months ago
- An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivatio…☆80Updated last year
- [CVPR25] Official implementation of `MobileMamba: Lightweight Multi-Receptive Visual Mamba Network.'☆140Updated 2 weeks ago
- The official code of "Rethinking Local Perception in Lightweight Vision Transformer"☆85Updated last year
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆67Updated 2 months ago
- Official repository of MLLA (NeurIPS 2024)☆286Updated 3 months ago
- RepNeXt: A Fast Multi-Scale CNN using Structural Reparameterization☆36Updated 5 months ago
- Official repository for the AAAI2025 paper ( Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Paramete…☆32Updated 2 months ago
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆75Updated last year
- GroupMixAttention and GroupMixFormer☆115Updated last year
- ☆170Updated 2 months ago
- ☆25Updated 2 weeks ago
- [CVPR 2024] Rewrite the Stars☆358Updated 10 months ago
- ☆63Updated 2 years ago
- [CVPR'24] Official implementation of paper "FreeKD: Knowledge Distillation via Semantic Frequency Prompt".☆39Updated 10 months ago
- ☆136Updated 8 months ago
- The official project website of "KernelWarehouse: Rethinking the Design of Dynamic Convolution" (KW for short, published in ICML 2024)☆97Updated 9 months ago
- [ICLR 2025] Official PyTorch implementation of "DECO: Query-Based End-to-End Object Detection with ConvNets"☆46Updated last month