ms-dot-k / AVSRView on GitHub
PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring" (CVPR2023) and "Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition" (Interspeech 2022)
20Apr 3, 2024Updated last year

Alternatives and similar repositories for AVSR

Users that are interested in AVSR are comparing it to the libraries listed below

Sorting:

Are these results useful?