atosystem / SpeechCLIP

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022
109Updated last year

Related projects

Alternatives and complementary repositories for SpeechCLIP