cg1177 / VideoLLMLinks
VideoLLM: Modeling Video Sequence with Large Language Models
☆158Updated 2 years ago
Alternatives and similar repositories for VideoLLM
Users that are interested in VideoLLM are comparing it to the libraries listed below
Sorting:
- PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models☆259Updated 2 months ago
- [ECCV2024] Official code implementation of Merlin: Empowering Multimodal LLMs with Foresight Minds☆94Updated last year
- ☆188Updated last year
- [NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale☆198Updated last year
- The official repository of "Video assistant towards large language model makes everything easy"