astorfi / lip-reading-deeplearning
Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
☆1,867Updated 2 years ago
Alternatives and similar repositories for lip-reading-deeplearning:
Users that are interested in lip-reading-deeplearning are comparing it to the libraries listed below
- Deep Learning & 3D Convolutional Neural Networks for Speaker Verification☆784Updated 5 years ago
- Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network (ECCV 2018)☆4,996Updated 2 years ago
- SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/☆884Updated 4 months ago
- Keras implementation of 'LipNet: End-to-End Sentence-level Lipreading'☆659Updated 2 years ago
- End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow☆2,842Updated 2 years ago
- The PyTorch improved version of TPAMI 2017 paper: Face Alignment in Full Pose Range: A 3D Total Solution.☆3,647Updated 2 years ago
- Facebook AI Research's Automatic Speech Recognition Toolkit☆6,421Updated 5 months ago
- A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model☆1,831Updated 3 years ago
- Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.☆5,660Updated last year
- Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow☆3,980Updated 3 years ago
- Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.☆8,663Updated 2 years ago
- Code and models for evaluating a state-of-the-art lip reading network☆196Updated 2 years ago
- A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body☆7,040Updated 2 years ago
- Automated Lip reading from real-time videos in tensorflow in python☆162Updated 7 years ago
- Pytorch-based tools for visualizing and understanding the neurons of a GAN. https://gandissect.csail.mit.edu/☆1,770Updated 3 years ago
- This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…☆1,571Updated 7 months ago
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)☆2,975Updated last year
- A denoising autoencoder + adversarial losses and attention mechanisms for face swapping.☆3,412Updated 3 years ago
- PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models☆1,976Updated last year
- Progressive Growing of GANs for Improved Quality, Stability, and Variation☆6,133Updated 3 years ago
- A method to generate speech across multiple speakers☆873Updated 6 years ago
- Interactive Image Generation via Generative Adversarial Networks☆3,994Updated 4 years ago
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.☆1,864Updated 2 years ago
- WaveNet vocoder☆2,356Updated last year
- A Flow-based Generative Network for Speech Synthesis☆2,328Updated last year
- A powerful and intuitive WYSIWYG interface that allows anyone to create Machine Learning models!☆1,863Updated 2 years ago
- demo code for lip reading☆21Updated 8 years ago
- 🏖 Keras Implementation of Painting outside the box☆1,144Updated 2 years ago
- Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)☆818Updated 3 years ago
- DeepMind's Tacotron-2 Tensorflow implementation☆2,310Updated last year