LeeYongHyeok / DCM_vgg_transformer

Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using fairseq
12Updated 4 years ago

Related projects: