BAI-Yeqi / SF2F_PyTorchLinks
☆16Updated 9 months ago
Alternatives and similar repositories for SF2F_PyTorch
Users that are interested in SF2F_PyTorch are comparing it to the libraries listed below
Sorting:
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆166Updated 5 years ago
- AVSpeech downloader☆68Updated 7 years ago
- Audio-Visual Speech Separation with Cross-Modal Consistency☆246Updated 2 years ago
- Our submission to the ASVspoof 2019: Automatic Speaker Verification Spoofing and Countermeasures Challenge☆102Updated 5 years ago
- Code for the Active Speakers in Context Paper (CVPR2020)☆56Updated 4 years ago
- Fully reproduce the paper of StarGAN-VC. Stable training and Better audio quality .☆247Updated last year
- A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.☆240Updated last year
- Pytorch code for End-to-End Audiovisual Speech Recognition☆183Updated 3 years ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆189Updated 6 years ago
- PPG-Based Voice Conversion☆347Updated 3 years ago
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆219Updated 2 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆359Updated 3 years ago
- This repository includes the code to reproduce our paper "End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification A…☆92Updated 2 years ago
- ☆244Updated 6 years ago
- ☆482Updated 5 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆184Updated 5 years ago
- JHU's system submission to the ASVspoof 2019 Challenge: Anti-Spoofing with Squeeze-Excitation and Residual neTworks (ASSERT).☆57Updated 3 years ago
- A ResNet Speaker Recognition&Verification Demo☆26Updated 4 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆320Updated 5 years ago
- Implementation of the paper: Replay and Synthetic Speech Detection with Res2Net architecture (ICASSP 2021) https://arxiv.org/abs/2010.150…☆83Updated 4 years ago
- An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification dataset☆12Updated 6 years ago
- Audio-Visual Speech Recognition using Sequence to Sequence Models☆83Updated 5 years ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆46Updated 5 years ago
- Executable code based on Google articles☆166Updated 3 years ago
- ☆49Updated 6 years ago
- Tools for downloading VoxCeleb2 dataset☆33Updated last year
- The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the…☆165Updated 4 months ago
- Augmentation adversarial training for self-supervised speaker recognition☆78Updated 4 years ago
- Include some core functions and model to handle speech separation