changil / facevoice
Learning associations between human faces and voices
☆12Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for facevoice
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆71Updated 5 years ago
- This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training mor…☆37Updated last year
- Tensorflow 2 implementation of Speech Separation Methods☆24Updated 4 years ago
- Develop speaker recognition model based on i-vector using TIMIT database☆16Updated 5 years ago
- CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion☆41Updated 4 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆42Updated 4 years ago
- Robust Speech Activity Detection (SAD) in movie audio☆25Updated 3 years ago
- This Repository includes four different implementations of the Speaker Verification task including the GMM_UBM, Ivector, Deep-Speaker, an…☆32Updated 6 years ago
- Coordinate-wise meta-learner for speaker adaptation of ASR models.☆20Updated 4 years ago
- MTGAN: Speaker Verification through Multitasking Triplet Generative Adversarial Networks☆19Updated 4 years ago
- ☆51Updated 5 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆37Updated 5 years ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆44Updated 4 years ago
- Gaussian Mixture VAE Tacotron☆53Updated last year
- about Speech enhancement☆33Updated 6 years ago
- ☆17Updated 5 years ago
- speaker recognition using keras☆36Updated last year
- Region proposal network based small-footprint keyword spotting (Pytorch)☆52Updated last year
- py-webrtcvad wrapper for trimming speech clips☆47Updated 2 years ago
- AVSpeech downloader☆66Updated 5 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- SE-Resnet+AMSoftmax for Speaker Verification☆48Updated 6 years ago
- Tensorflow implementation of "Speaker-independent Speech Separation with Deep Attractor Network"☆89Updated 3 years ago
- ☆9Updated 6 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- Seeing Wake Words: Audio-visual Keyword Spotting☆62Updated 4 years ago
- Tacotron2 with Global Style Tokens☆63Updated 5 years ago