BAI-Yeqi / SF2F_PyTorchLinks
☆15Updated last month
Alternatives and similar repositories for SF2F_PyTorch
Users that are interested in SF2F_PyTorch are comparing it to the libraries listed below
Sorting:
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆160Updated 5 years ago
- ☆18Updated 4 years ago
- Our submission to the ASVspoof 2019: Automatic Speaker Verification Spoofing and Countermeasures Challenge☆99Updated 5 years ago
- Audio-Visual Speech Separation with Cross-Modal Consistency☆231Updated last year
- ☆23Updated 5 years ago
- Implementation of the paper: Replay and Synthetic Speech Detection with Res2Net architecture (ICASSP 2021) https://arxiv.org/abs/2010.150…☆82Updated 3 years ago
- Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)☆74Updated 4 years ago
- Tools for downloading VoxCeleb2 dataset☆30Updated last year
- A ResNet Speaker Recognition&Verification Demo☆26Updated 3 years ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆44Updated 3 years ago
- JHU's system submission to the ASVspoof 2019 Challenge: Anti-Spoofing with Squeeze-Excitation and Residual neTworks (ASSERT).☆57Updated 2 years ago
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆209Updated 2 years ago
- AVSpeech downloader☆67Updated 6 years ago
- ☆17Updated 6 months ago
- This repository includes the code to reproduce our paper "End-to-end anti-spoofing with RawNet2" (https://arxiv.org/abs/2011.01108) publi…☆59Updated last year
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆141Updated 2 years ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆117Updated last year
- ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'☆89Updated 2 years ago
- Code for the Active Speakers in Context Paper (CVPR2020)☆54Updated 4 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆41Updated 2 years ago
- Spoofing Speaker Verification Systems with Multi-speaker Text-to-speech Synthesis☆11Updated 2 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆183Updated 4 years ago
- A summary of speech data augment algorithms☆68Updated 4 years ago
- This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Spea…☆61Updated last year
- This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".☆91Updated 3 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆81Updated 3 years ago
- Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices☆66Updated last year
- PyTorch implementation of Densely Connected Time Delay Neural Network☆88Updated 2 years ago
- Official PyTorch implementation of paper Leveraging Unimodal Self Supervised Learning for Multimodal Audio-Visual Speech Recognition (ACL…☆65Updated 2 years ago
- This is the implementation of the paper "Adversarial Attacks on Spoofing Countermeasures of automatic speaker verification".☆43Updated 2 years ago