caojiaolong / Awesome-Mamba
Collect papers about Mamba (a selective state space model).
☆13Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for Awesome-Mamba
- ☆22Updated last year
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆14Updated last month
- ☆18Updated last month
- More dimensions = More fun☆21Updated 3 months ago
- Officail Repo of γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models☆18Updated 3 weeks ago
- ☆41Updated 7 months ago
- Official implementation of NeurIPS 2024 "Visual Fourier Prompt Tuning"☆13Updated last week
- ☆31Updated last month
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆64Updated 5 months ago
- [NIPS2023]Implementation of Foundation Model is Efficient Multimodal Multitask Model Selector☆34Updated 8 months ago
- ☆19Updated 3 months ago
- ☆27Updated 2 weeks ago
- ☆20Updated 7 months ago
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆34Updated 5 months ago
- ☆29Updated 7 months ago
- ☆48Updated 5 months ago
- Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation☆23Updated last month
- Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".☆31Updated 2 months ago
- [ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation☆56Updated 2 months ago
- ☆52Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆35Updated last year
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆72Updated 2 months ago
- Generative Multi-modal Models are Good Class Incremental Learners, CVPR 2024 [PyTorch Code]☆35Updated this week
- Adapting LLaMA Decoder to Vision Transformer☆27Updated 6 months ago
- A simpler Pytorch + Zeta Implementation of the paper: "SiMBA: Simplified Mamba-based Architecture for Vision and Multivariate Time series…☆27Updated last week
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …☆39Updated 2 weeks ago
- The efficient tuning method for VLMs☆76Updated 8 months ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆30Updated 5 months ago
- ☆33Updated 4 months ago
- Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs (ECCV 2024)☆15Updated 4 months ago