Joovvhan / ECAPA-TDNNLinks
Unofficial implementation of ECAPA-TDNN
☆29Updated 4 years ago
Alternatives and similar repositories for ECAPA-TDNN
Users that are interested in ECAPA-TDNN are comparing it to the libraries listed below
Sorting:
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Updated 6 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆77Updated 6 months ago
- ☆32Updated 2 years ago
- ☆69Updated 4 years ago
- ☆36Updated 4 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆106Updated 3 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆71Updated 4 years ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated 2 years ago
- ☆61Updated 2 years ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆118Updated last year
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 3 years ago
- The official source code of UniAudio☆95Updated last year
- Objective metrics used in several text-to-speech (TTS) papers.☆49Updated last month
- A simple package for Guided source separation (GSS)☆127Updated last year
- ☆35Updated 2 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆155Updated 4 years ago
- ☆98Updated 2 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆43Updated 4 years ago
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆33Updated 5 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Updated 4 years ago
- Reference-aware automatic speech evaluation toolkit☆159Updated 8 months ago
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆63Updated 2 years ago
- ☆30Updated 2 years ago
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Updated 5 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Updated 2 years ago
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆102Updated 2 years ago
- ☆76Updated 3 years ago
- An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"☆98Updated 3 years ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆55Updated 2 months ago