Daria8976 / MMADView on GitHub
We propose MMAD, a novel automated pipeline for precise AD generation. MMAD introduces ambient music alongside visual and linguistic, enhancing the model's multimodal representation learning through modality encoders and alignment.
16Dec 31, 2024Updated last year

Alternatives and similar repositories for MMAD

Users that are interested in MMAD are comparing it to the libraries listed below

Sorting:

Are these results useful?