PKU-YuanGroup / Chat-UniViLinks
[CVPR 2024 Highlightπ₯] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
β943Updated 9 months ago
Alternatives and similar repositories for Chat-UniVi
Users that are interested in Chat-UniVi are comparing it to the libraries listed below
Sorting:
- β411Updated last year
- LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in β¦β517Updated last month
- Official PyTorch implementation of "Multi-modal Queried Object Detection in the Wild" (accepted by NeurIPS 2023)β341Updated last year
- [CVPR 2024 Highlight] MIGC and [TPAMI 2024] MIGC++ (Official Implementation)β606Updated 2 months ago
- LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)β829Updated last year
- [ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Modelβ333Updated 9 months ago
- [CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understandingβ636Updated 6 months ago
- A curated list of research based on CLIP.