matthijsvk / multimodalSR

Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.
66Updated last year

Related projects

Alternatives and complementary repositories for multimodalSR