JustinYuu / MM_Pyramid
[ACM MM 2022] MM_Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing
☆13Updated 2 years ago
Alternatives and similar repositories for MM_Pyramid:
Users that are interested in MM_Pyramid are comparing it to the libraries listed below
- Vision Transformers are Parameter-Efficient Audio-Visual Learners