NKU-MetautoAI / awesome-large-vision-language-modelsLinks
Advances in recent large vision language models (LVLMs)
☆15Updated 10 months ago
Alternatives and similar repositories for awesome-large-vision-language-models
Users that are interested in awesome-large-vision-language-models are comparing it to the libraries listed below
Sorting:
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆80Updated 4 months ago
- Implementation of ''VPUFormer: Visual Prompt Unified Transformer for Interactive Image Segmentation''☆13Updated 2 years ago
- ☆35Updated last year
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆82Updated last month
- ☆36Updated 2 years ago
- [CVPR 2023] Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detection☆31Updated 2 years ago
- [ECCV 2024 Workshop Best Paper Award] Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion☆34Updated 10 months ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆80Updated last year
- ☆30Updated last year
- ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.☆41Updated last year
- [CVPR'23] A Simple Framework for Text-Supervised Semantic Segmentation☆60Updated 6 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆86Updated 4 months ago
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".☆40Updated 6 months ago
- ☆77Updated 2 years ago
- Segment Anything with Deictic Prompting☆27Updated 2 months ago
- [ECCV 2024] Official project of CoDA: Instructive Chain-of-Domain Adaptation with Severity-Aware Visual Prompt Tuning☆40Updated last year
- ☆59Updated last year
- [CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation☆26Updated 3 months ago
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆58Updated last year
- Official Pytorch implementation of "E2VPT: An Effective and Efficient Approach for Visual Prompt Tuning". (ICCV2023)☆72Updated last year
- [ECCV'24 Oral] Anytime Continual Learning for Open Vocabulary Classification☆21Updated 9 months ago
- [ECCV' 24] CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection☆26Updated 10 months ago
- [ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs☆51Updated 2 weeks ago
- [ECCV 2024] Soft Prompt Generation for Domain Generalization☆25Updated 10 months ago
- [CVPR 2025 Highlight] Official code for paper "Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-G…☆39Updated 2 months ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆51Updated 11 months ago
- (CVPR 2024) ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning☆45Updated 7 months ago
- ☆23Updated 9 months ago
- This repo is the official pytorch implementation of the paper: CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-V…☆34Updated 7 months ago
- ☆21Updated last week