astorfi / lip-reading-deeplearning
Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
☆1,833Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for lip-reading-deeplearning
- Keras implementation of 'LipNet: End-to-End Sentence-level Lipreading'☆643Updated last year
- Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow☆3,954Updated 3 years ago
- Deep Learning & 3D Convolutional Neural Networks for Speaker Verification☆782Updated 4 years ago
- SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/☆880Updated 3 years ago
- PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models☆1,969Updated 11 months ago
- Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.☆5,607Updated 8 months ago
- A Flow-based Generative Network for Speech Synthesis☆2,288Updated last year
- A method to generate speech across multiple speakers☆872Updated 5 years ago
- Deep neural networks for voice conversion (voice style transfer) in Tensorflow☆3,923Updated 2 years ago
- End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow☆2,845Updated last year
- A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model☆1,828Updated 2 years ago
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)☆2,957Updated last year
- Code and models for evaluating a state-of-the-art lip reading network☆189Updated last year
- DeepMind's Tacotron-2 Tensorflow implementation☆2,276Updated last year
- RNN-based generative models for speech.☆611Updated 7 years ago
- StarGAN - Official PyTorch Implementation (CVPR 2018)☆5,230Updated 3 years ago
- A TensorFlow Implementation of DC-TTS: yet another text-to-speech model☆1,158Updated last year
- 2D and 3D Face alignment library build using pytorch☆7,093Updated 2 months ago
- Face detection, tracking and clustering in videos☆443Updated 7 months ago
- A general-purpose encoder-decoder framework for Tensorflow☆5,605Updated 4 years ago
- This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…☆1,560Updated last month
- Tensorflow Implementation of Deep Voice 3☆453Updated 6 years ago
- Automatically "block" people in images (like Black Mirror) using a pretrained neural network.☆2,020Updated 2 years ago
- The Open Images dataset☆4,265Updated 3 years ago
- Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.☆535Updated 2 weeks ago
- WaveNet vocoder☆2,327Updated last year
- Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP☆1,550Updated 3 years ago
- An implementation of iPhone X's FaceID using face embeddings and siamese networks on RGBD images.☆907Updated 4 years ago
- 🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks☆2,166Updated 10 months ago