umbertocappellazzo / Llama-AVSR

[ICASSP 2025] Official Pytorch implementation of "Large Language Models are Strong Audio-Visual Speech Recognition Learners".
14Updated 2 weeks ago

Alternatives and similar repositories for Llama-AVSR:

Users that are interested in Llama-AVSR are comparing it to the libraries listed below