matthijsvk / multimodalSR

Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.
67Updated 2 years ago

Alternatives and similar repositories for multimodalSR:

Users that are interested in multimodalSR are comparing it to the libraries listed below