yeruoforever / Awesome-Mamba
Awsome works based on SSM and Mamba
☆14Updated 5 months ago
Related projects: ⓘ
- [CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities☆85Updated 6 months ago
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆75Updated 6 months ago
- Implementation of ViTaR: ViTAR: Vision Transformer with Any Resolution in PyTorch☆22Updated last week
- ☆104Updated 3 months ago
- [CVPR 2024] The official pytorch implementation of "A General and Efficient Training for Transformer via Token Expansion".☆36Updated 4 months ago
- ☆41Updated 5 months ago
- ☆12Updated 2 months ago
- ☆24Updated last month
- ☆49Updated 11 months ago
- ☆40Updated 3 months ago
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆67Updated 3 weeks ago
- This repository contains the pytorch code for our ISBI 2024 paper "ConvLoRA and AdaBN Based Domain Adaptation via Self-Training".☆45Updated 6 months ago
- Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆44Updated 3 weeks ago
- Visual self-questioning for large vision-language assistant.☆22Updated 3 weeks ago
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆53Updated 3 months ago
- Introduce Mamba2 to Vision.☆70Updated 3 weeks ago
- [CVPR 2024] This is the official implementation of "ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction"☆25Updated 2 months ago
- Second Generation of the MAMBA Software☆27Updated last year
- CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation☆56Updated last month
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Des…☆47Updated 2 months ago
- ☆16Updated this week
- Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"☆103Updated last month
- [CVPR 2024] FairCLIP: Harnessing Fairness in Vision-Language Learning☆46Updated last month
- Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)☆41Updated 3 months ago
- ☆54Updated 2 months ago
- Official implementation of TagAlign☆31Updated 5 months ago
- The official implementation of GrootVL: Tree Topology is All You Need in State Space Model☆58Updated 3 months ago
- ☆34Updated 11 months ago
- ☆19Updated 11 months ago
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆80Updated 9 months ago