BAI-Yeqi / SF2F_PyTorchLinks
☆16Updated 2 months ago
Alternatives and similar repositories for SF2F_PyTorch
Users that are interested in SF2F_PyTorch are comparing it to the libraries listed below
Sorting:
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆160Updated 5 years ago
- Audio-Visual Speech Separation with Cross-Modal Consistency☆232Updated last year
- A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.☆234Updated last year
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆210Updated 2 years ago
- A ResNet Speaker Recognition&Verification Demo☆26Updated 3 years ago
- Our submission to the ASVspoof 2019: Automatic Speaker Verification Spoofing and Countermeasures Challenge☆99Updated 5 years ago
- JHU's system submission to the ASVspoof 2019 Challenge: Anti-Spoofing with Squeeze-Excitation and Residual neTworks (ASSERT).☆57Updated 2 years ago
- AVSpeech downloader☆67Updated 6 years ago
- ☆23Updated 6 years ago
- Pytorch code for End-to-End Audiovisual Speech Recognition☆177Updated 2 years ago
- ☆477Updated 4 years ago
- This repository includes the code to reproduce our paper "End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification A…☆90Updated last year
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆351Updated 3 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆183Updated 4 years ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆116Updated last year
- This is the GitHub page for publicly available emotional speech data.☆357Updated 3 years ago
- Implementation of the paper: Replay and Synthetic Speech Detection with Res2Net architecture (ICASSP 2021) https://arxiv.org/abs/2010.150…☆82Updated 3 years ago
- Audio-Visual Speech Recognition using Sequence to Sequence Models☆82Updated 5 years ago
- Mel cepstral distortion (MCD) computations in python.☆224Updated 8 years ago
- 3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition.☆40Updated 4 years ago
- PyTorch implementation of Densely Connected Time Delay Neural Network☆88Updated 2 years ago
- This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.…☆133Updated last year
- Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)☆22Updated last year
- Implementation of the paper: Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks (INTERSPEECH 2021)☆31Updated 3 years ago
- ☆169Updated last year
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆141Updated 2 years ago
- PPG-Based Voice Conversion☆341Updated 2 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆317Updated 4 years ago
- ☆12Updated 4 years ago
- Pytorch implemenation of the model proposed in the paper: Double Multi-Head Attention for Speaker Verification☆20Updated 11 months ago