matthijsvk / multimodalSR

Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.
66Updated 2 years ago

Related projects

Alternatives and complementary repositories for multimodalSR