dynamic-superb / multimodal-llama
View external linksLinks

The official implementation of ImageBind-LLM and Whisper-LLM from the paper "Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech".
21Oct 30, 2023Updated 2 years ago

Alternatives and similar repositories for multimodal-llama

Users that are interested in multimodal-llama are comparing it to the libraries listed below

Sorting:

Are these results useful?