Joovvhan / ECAPA-TDNN
Unofficial implementation of ECAPA-TDNN
☆27Updated 3 years ago
Related projects: ⓘ
- ☆31Updated 3 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆53Updated 3 years ago
- ☆70Updated last month
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆35Updated 3 months ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆42Updated this week
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆71Updated last year
- ☆32Updated 2 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆43Updated 5 years ago
- ☆69Updated 3 years ago
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆42Updated 9 months ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆24Updated 2 months ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 2 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆67Updated 3 years ago
- Computes the Mel-Cepstral Distance of two WAV files based on the paper "Mel-Cepstral Distance Measure for Objective Speech Quality Assess…☆46Updated 7 months ago
- ☆29Updated 2 years ago
- MultiSV: scripts for data preparation☆24Updated 3 months ago
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆57Updated 2 years ago
- PyTorch Implementation of Generalized End-to-End Loss for Speaker Verification☆28Updated 3 years ago
- A simple package for Guided source separation (GSS)☆104Updated 4 months ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆26Updated last year
- SpEx+(tied) source code☆72Updated last year
- Python package for combining diarization system outputs.☆73Updated 11 months ago
- Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation☆94Updated 2 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆37Updated 3 years ago
- Production first, nn-based on-device signal processing toolkit.☆63Updated last year
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆31Updated 4 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆42Updated 2 years ago
- Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications☆40Updated 2 years ago
- Discriminative Training of VBx Diarization☆17Updated 7 months ago