SliMM-X / CoMP-MM
Official repository of "CoMP: Continual Multimodal Pre-training for Vision Foundation Models"
☆24Updated 3 weeks ago
Alternatives and similar repositories for CoMP-MM:
Users that are interested in CoMP-MM are comparing it to the libraries listed below
- [NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…☆35Updated 10 months ago
- This is the official repo for ByteVideoLLM/Dynamic-VLM☆20Updated 4 months ago
- The official repository for paper "PruneVid: Visual Token Pruning for Efficient Video Large Language Models".☆36Updated 2 months ago
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆74Updated 6 months ago
- Official implementation of TagAlign☆34Updated 4 months ago
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model