[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
☆3,805Feb 13, 2025Updated last year
Alternatives and similar repositories for Vim
Users that are interested in Vim are comparing it to the libraries listed below
Sorting:
- VMamba: Visual State Space Models,code is based on mamba☆3,054Mar 7, 2025Updated 11 months ago
- Mamba SSM architecture☆17,257Feb 18, 2026Updated 2 weeks ago
- [CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone☆2,034Feb 9, 2026Updated 3 weeks ago
- Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Mod…☆482Feb 10, 2026Updated 3 weeks ago
- [ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding☆1,082Jul 6, 2024Updated last year
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆274May 6, 2024Updated last year
- U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation☆955Apr 4, 2024Updated last year
- Awesome Papers related to Mamba.☆1,390Oct 17, 2024Updated last year
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,921Mar 8, 2024Updated last year
- (ACM TOMM) This is the official code repository for "VM-UNet: Vision Mamba UNet for Medical Image Segmentation".☆799Sep 3, 2025Updated 6 months ago
- [Official Repo] Visual Mamba: A Survey and New Outlooks☆731Feb 18, 2025Updated last year
- Code Implementation of EfficientVMamba☆243Apr 16, 2024Updated last year
- [ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures☆541Feb 18, 2025Updated last year
- [Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications☆745Jun 28, 2025Updated 8 months ago
- SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation☆546Jun 28, 2025Updated 8 months ago
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆8,382May 31, 2024Updated last year
- [ECCV2024, CVPR2025] MambaIR and MambaIRv2!☆1,029Apr 15, 2025Updated 10 months ago
- MambaOut: Do We Really Need Mamba for Vision? (CVPR 2025)☆2,656Mar 9, 2025Updated 11 months ago
- A simple and efficient Mamba implementation in pure PyTorch and MLX.☆1,433Jan 26, 2026Updated last month
- Causal depthwise conv1d in CUDA, with a PyTorch interface☆730Feb 18, 2026Updated 2 weeks ago
- [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…☆8,626Nov 10, 2025Updated 3 months ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,936Aug 15, 2024Updated last year
- Mamba-UNet Zoo☆781Sep 13, 2024Updated last year
- [IEEE TCSVT] Vivim: a Video Vision Mamba for Medical Video Segmentation☆185Jun 12, 2025Updated 8 months ago
- Simba☆218Mar 24, 2024Updated last year
- ✨✨Latest Papers on Vision Mamba and Related Areas☆382Apr 17, 2025Updated 10 months ago
- Efficient vision foundation models for high-resolution generation and perception.☆3,249Sep 5, 2025Updated 5 months ago
- Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining☆372Mar 19, 2024Updated last year
- The suite of modeling video with Mamba☆290May 14, 2024Updated last year
- PyTorch code and models for the DINOv2 self-supervised learning method.☆12,427Feb 24, 2026Updated last week
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆15,721Jul 24, 2024Updated last year
- This is the official code repository for "MedMamba: Vision Mamba for Medical Image Classification"☆580Sep 10, 2024Updated last year
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,420Updated this week
- A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model" (ECCV 2024)☆343Mar 17, 2025Updated 11 months ago
- [NeurIPS 2024] Official repository of MLLA☆371Jul 11, 2025Updated 7 months ago
- ☆4,577Sep 14, 2025Updated 5 months ago
- [CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation☆8,006Jul 17, 2024Updated last year
- VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks☆391Jul 9, 2024Updated last year
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,478Aug 12, 2024Updated last year