dynamic-superb / multimodal-llamaLinks

The official implementation of ImageBind-LLM and Whisper-LLM from the paper "Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech".
21Updated last year

Alternatives and similar repositories for multimodal-llama

Users that are interested in multimodal-llama are comparing it to the libraries listed below

Sorting: