ms-dot-k / AVSR

PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring" (CVPR2023) and "Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition" (Interspeech 2022)
14Updated 10 months ago

Alternatives and similar repositories for AVSR:

Users that are interested in AVSR are comparing it to the libraries listed below