VideoLLM: Modeling Video Sequence with Large Language Models
☆158Aug 18, 2023Updated 2 years ago
Alternatives and similar repositories for VideoLLM
Users that are interested in VideoLLM are comparing it to the libraries listed below
Sorting:
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Jun 20, 2023Updated 2 years ago
- [NeurIPS 2022] Egocentric Video-Language Pretraining☆254May 9, 2024Updated last year
- [ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the cap…☆1,492Aug 5, 2025Updated 6 months ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆13Nov 4, 2023Updated 2 years ago
- [CVPR 2024] Data and benchmark code for the EgoExoLearn dataset☆80Aug 26, 2025Updated 6 months ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection