nguyentthong / video-language-understanding
[ACL’24 Findings] Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives
☆37Updated 5 months ago
Alternatives and similar repositories for video-language-understanding:
Users that are interested in video-language-understanding are comparing it to the libraries listed below
- Official implementation of HawkEye: Training Video-Text LLMs for Grounding Text in Videos☆37Updated 9 months ago
- [CVPR 2022] A large-scale public benchmark dataset for video question-answering, especially about evidence and commonsense reasoning. The…☆53Updated 7 months ago
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆63Updated 7 months ago
- Official repository for the A-OKVQA dataset