SuchismitaSahu1993 / Lipreading-Using-Mutimodal-Speech-Recognition
Multimodal Speech Recognition for phoneme level prediction using Audio-Visual data from TCDTIMIT dataset implementing RNNs with LSTMs for the audio subnetwork and CNN-LSTMs for the video subnetwork.
☆14Updated last year
Alternatives and similar repositories for Lipreading-Using-Mutimodal-Speech-Recognition
Users that are interested in Lipreading-Using-Mutimodal-Speech-Recognition are comparing it to the libraries listed below
Sorting: