mbzuai-oryx / Video-ChatGPTLinks
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
☆1,396Updated 3 months ago
Alternatives and similar repositories for Video-ChatGPT
Users that are interested in Video-ChatGPT are comparing it to the libraries listed below
Sorting:
- VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs