mugen-org / MUGEN_baseline

multimodal video-audio-text generation and retrieval between every pair of modalities on the MUGEN dataset. The repo. contains the training, evaluation and inference codes for these baselines.
38Updated last year

Related projects: