AIoT-MLSys-Lab / Famba-VLinks
[ECCV 2024 Workshop Best Paper Award] Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion
☆34Updated last year
Alternatives and similar repositories for Famba-V
Users that are interested in Famba-V are comparing it to the libraries listed below
Sorting:
- ☆30Updated last year
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆87Updated 9 months ago
- [NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model☆105Updated last year
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆86Updated 7 months ago
- [ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆90Updated 7 months ago
- Official implementation of CVPR 2024 paper "Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers".☆40Updated 5 months ago
- [ICCV 2025] Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction☆51Updated 3 months ago
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".☆42Updated 11 months ago
- Official implementation of "SViT: Revisiting Token Pruning for Object Detection and Instance Segmentation"☆36Updated 2 years ago
- ☆92Updated last year
- ☆24Updated last year
- [CVPR 2025 Highlight] Official code for paper "Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-G…☆52Updated 7 months ago
- ☆47Updated 5 months ago
- Awesome video instance segmentation papers☆50Updated 3 weeks ago
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆115Updated last year
- ☆27Updated last year
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆96Updated 9 months ago
- Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"☆44Updated 9 months ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆54Updated last year
- ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.☆43Updated last year
- [AAAI 2026 Oral] LENS: Learning to Segment Anything with Unified Reinforced Reasoning☆96Updated last month
- [IJCV 2024]☆19Updated last year
- [CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation☆28Updated 8 months ago
- [NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation☆20Updated last year
- CAD - Memory Efficient Convolutional Adapter for Segment Anything☆12Updated last year
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆43Updated 6 months ago
- ☆17Updated 8 months ago
- The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)☆39Updated 7 months ago
- Advances in recent large vision language models (LVLMs)☆15Updated last year
- ☆79Updated 10 months ago