DAMO-NLP-SG / Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
2,883Updated 7 months ago

Alternatives and similar repositories for Video-LLaMA:

Users that are interested in Video-LLaMA are comparing it to the libraries listed below