carykh / videoToVoice

takes in a sequence of lip images, and predicts the phonemes being said.
122Updated 11 months ago

Related projects

Alternatives and complementary repositories for videoToVoice