Joovvhan / ECAPA-TDNNLinks
Unofficial implementation of ECAPA-TDNN
☆30Updated 4 years ago
Alternatives and similar repositories for ECAPA-TDNN
Users that are interested in ECAPA-TDNN are comparing it to the libraries listed below
Sorting:
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Updated 6 years ago
- ☆32Updated 3 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆77Updated 7 months ago
- A simple package for Guided source separation (GSS)☆128Updated last year
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆63Updated 3 years ago
- ☆69Updated 4 years ago
- ☆103Updated 2 years ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated 2 years ago
- ☆36Updated 4 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆107Updated 4 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆95Updated 4 years ago
- An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"☆98Updated 3 years ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆82Updated 5 months ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆55Updated 3 weeks ago
- INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"☆114Updated last year
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆71Updated 4 years ago
- ☆86Updated 4 months ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆78Updated 2 months ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 3 years ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆119Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆44Updated 4 years ago
- Reference-aware automatic speech evaluation toolkit☆164Updated 9 months ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆76Updated 2 years ago
- A toolkit for any-to-any encoder-decoder voice conversion systems☆84Updated 2 years ago
- Production first, nn-based on-device signal processing toolkit.☆64Updated 2 years ago
- The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023☆120Updated 2 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆156Updated 4 years ago
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆94Updated 2 years ago
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆57Updated 7 months ago
- ☆62Updated 2 years ago