astorfi / lip-reading-deeplearning
Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
☆1,844Updated 2 years ago
Alternatives and similar repositories for lip-reading-deeplearning:
Users that are interested in lip-reading-deeplearning are comparing it to the libraries listed below
- Deep neural networks for voice conversion (voice style transfer) in Tensorflow☆3,929Updated 2 years ago
- SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/☆882Updated last month
- Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow☆3,965Updated 3 years ago
- End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow☆2,841Updated last year
- Keras implementation of 'LipNet: End-to-End Sentence-level Lipreading'☆651Updated 2 years ago
- Deep Learning & 3D Convolutional Neural Networks for Speaker Verification☆782Updated 4 years ago
- PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models☆1,972Updated last year
- Use supervised learning to illuminate the latent space of GAN for controlled generation and edit☆1,972Updated 4 years ago
- Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network (ECCV 2018)☆4,978Updated 2 years ago
- Tensorflow Implementation of Deep Voice 3☆453Updated 6 years ago
- A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model☆1,827Updated 3 years ago
- speech to text benchmark framework☆625Updated last month
- Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)☆818Updated 3 years ago
- Code and models for evaluating a state-of-the-art lip reading network☆193Updated last year
- 🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks☆2,163Updated last year
- The PyTorch improved version of TPAMI 2017 paper: Face Alignment in Full Pose Range: A 3D Total Solution.☆3,623Updated 2 years ago
- Facebook AI Research's Automatic Speech Recognition Toolkit☆6,401Updated last month
- A method to generate speech across multiple speakers☆872Updated 5 years ago
- Minimalist and powerful Web Crawler.☆879Updated 4 years ago
- Speech Recognition without audio input☆137Updated 6 years ago
- This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…☆1,566Updated 3 months ago
- This is the code for "Neural Network Voices" by Siraj Raval on Youtube☆359Updated 6 years ago
- pix2pix demo that learns from facial landmarks and translates this into a face☆1,441Updated last year
- Automated Lip reading from real-time videos in tensorflow in python☆159Updated 6 years ago
- A denoising autoencoder + adversarial losses and attention mechanisms for face swapping.☆3,397Updated 2 years ago
- A Flow-based Generative Network for Speech Synthesis☆2,300Updated last year
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)☆2,965Updated last year
- Tensorflow port of Image-to-Image Translation with Conditional Adversarial Nets https://phillipi.github.io/pix2pix/☆5,078Updated 3 years ago
- This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech S…☆700Updated last year
- A TensorFlow Implementation of DC-TTS: yet another text-to-speech model☆1,159Updated last year