umbertocappellazzo / Llama-AVSRLinks

Official Pytorch implementation of "Large Language Models are Strong Audio-Visual Speech Recognition Learners" [ICASSP 2025] and "Mitigating Attention Sinks and Massive Activations in Audio-Visual Speech Recognition with LLMs".
48Updated this week

Alternatives and similar repositories for Llama-AVSR

Users that are interested in Llama-AVSR are comparing it to the libraries listed below

Sorting: