ms-dot-k / TMT

TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages
14Updated 6 months ago

Related projects

Alternatives and complementary repositories for TMT