DragonLiu1995 / multimodal-llm-for-audio-genLinks

Code, Dataset, Samples for the NeurIPS paper “ Tell What You Hear From What You See -- Video to Audio Generation Through Text”
8Updated last week

Alternatives and similar repositories for multimodal-llm-for-audio-gen

Users that are interested in multimodal-llm-for-audio-gen are comparing it to the libraries listed below

Sorting: