GenjiB / LAVISH

Vision Transformers are Parameter-Efficient Audio-Visual Learners
85Updated last year

Related projects

Alternatives and complementary repositories for LAVISH