NKU-MetautoAI / awesome-large-vision-language-modelsLinks
Advances in recent large vision language models (LVLMs)
☆15Updated 11 months ago
Alternatives and similar repositories for awesome-large-vision-language-models
Users that are interested in awesome-large-vision-language-models are comparing it to the libraries listed below
Sorting:
- [ECCV 2024 Workshop Best Paper Award] Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion☆34Updated 11 months ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆83Updated last year
- ☆35Updated last year
- ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.☆41Updated last year
- [CVPR'23] A Simple Framework for Text-Supervised Semantic Segmentation☆60Updated 7 months ago
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆82Updated 3 months ago
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".☆41Updated 7 months ago
- ☆30Updated last year
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆80Updated 5 months ago
- Implementation of ''VPUFormer: Visual Prompt Unified Transformer for Interactive Image Segmentation''☆13Updated last month
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆89Updated 5 months ago
- Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""☆14Updated last year
- Adapters Strike Back (CVPR 2024)☆38Updated last year
- ☆36Updated 2 years ago
- cliptrase☆46Updated last year
- [CVPR 2024] Open-Set Domain Adaptation for Semantic Segmentation☆45Updated last year
- CAD - Memory Efficient Convolutional Adapter for Segment Anything☆12Updated 11 months ago
- [ECCV' 24] CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection☆27Updated 11 months ago
- [ECCV 2024] Soft Prompt Generation for Domain Generalization☆26Updated 11 months ago
- ☆17Updated 10 months ago
- (CVPR 2024) ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning☆45Updated 9 months ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆52Updated last year
- This repo is the official pytorch implementation of the paper: CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-V…☆36Updated this week
- This is the official code of "Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation, NeurIPS 23"☆26Updated last year
- [NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model☆101Updated last year
- [ICLR 2024] ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation☆68Updated last year
- Official Pytorch implementation of "E2VPT: An Effective and Efficient Approach for Visual Prompt Tuning". (ICCV2023)☆71Updated last year
- [NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation☆19Updated 10 months ago
- [CVPR 2024] The official pytorch implementation of "A General and Efficient Training for Transformer via Token Expansion".☆45Updated last year
- ☆33Updated 11 months ago