ms-dot-k / AVSRLinks
PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring" (CVPR2023) and "Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition" (Interspeech 2022)
☆18Updated last year
Alternatives and similar repositories for AVSR
Users that are interested in AVSR are comparing it to the libraries listed below
Sorting: