Sindhu-Hegde / pseudo-visual-speech-denoising
Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021
☆103Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for pseudo-visual-speech-denoising
- This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…☆111Updated 3 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆68Updated 5 years ago
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆154Updated 4 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆106Updated 8 months ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆340Updated 2 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆79Updated 3 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆182Updated 4 years ago
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆203Updated last year
- Tools for downloading VoxCeleb2 dataset☆26Updated 8 months ago
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆190Updated 2 years ago
- Audio-Visual Speech Separation with Cross-Modal Consistency☆221Updated last year
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆81Updated last year
- AVSpeech downloader☆66Updated 5 years ago
- Unofficial Pytorch Implementation of WaveGrad2☆111Updated 3 years ago
- Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention☆199Updated 3 years ago
- Code for the Active Speakers in Context Paper (CVPR2020)☆53Updated 3 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆87Updated 4 years ago
- Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.☆110Updated 3 years ago
- Include some core functions and model to handle speech separation☆154Updated 3 years ago
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆22Updated 8 months ago
- Audio-Visual Speech Recognition using Sequence to Sequence Models☆81Updated 4 years ago
- Official implementation of SpeechSplit2☆128Updated 2 years ago
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆137Updated 2 years ago
- A pytorch implementation of StarGAN-VC2☆146Updated 4 years ago
- a PyTorch implementation of Lip2Wav☆49Updated 2 years ago
- Emotional Speech Conversion using Style Transfer and MUNIT☆33Updated 5 years ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆184Updated 4 years ago
- Official Implementation of Visual Transformer Pooling for Lip reading☆36Updated 2 years ago
- ☆129Updated last year
- ☆28Updated 4 years ago