GenjiB / LAVISH

Vision Transformers are Parameter-Efficient Audio-Visual Learners
89Updated last year

Related projects

Alternatives and complementary repositories for LAVISH