Y-vector: Multiscale Waveform Encoder for Speaker Embedding
☆23Jul 16, 2024Updated last year
Alternatives and similar repositories for Y-vector
Users that are interested in Y-vector are comparing it to the libraries listed below
Sorting:
- ☆22Jun 30, 2021Updated 4 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Mar 17, 2023Updated 2 years ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- ☆10Apr 17, 2024Updated last year
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- ☆26Aug 8, 2024Updated last year
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆13Mar 9, 2022Updated 3 years ago
- ☆13Jan 14, 2025Updated last year
- Collection of self-supervised models for speaker and language recognition tasks.☆19Jan 18, 2022Updated 4 years ago
- Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,…☆27Updated this week
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Apr 10, 2023Updated 2 years ago
- ☆157Jan 9, 2023Updated 3 years ago
- a repository for trainabale tts multi speaker☆14Nov 28, 2021Updated 4 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- ☆14Aug 19, 2024Updated last year
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆97Sep 15, 2021Updated 4 years ago
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆54May 26, 2025Updated 9 months ago
- The Multi-band Excited WaveNet☆15Feb 2, 2023Updated 3 years ago
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆18Jul 23, 2024Updated last year
- ☆15Aug 25, 2022Updated 3 years ago
- PyTorch implementation of simplified neural source filter model (s-nsf)☆14Aug 4, 2021Updated 4 years ago
- phone inventory library☆17May 15, 2023Updated 2 years ago
- Open source cross-platform implementation of MRCP protocol☆20Mar 3, 2022Updated 4 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- ☆16Dec 23, 2021Updated 4 years ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 3 years ago
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆19Jun 24, 2022Updated 3 years ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆21Nov 1, 2024Updated last year
- Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)☆17Feb 29, 2024Updated 2 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 4 years ago
- ☆15May 8, 2021Updated 4 years ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Mar 7, 2023Updated 2 years ago
- This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"☆15Apr 8, 2024Updated last year
- FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS☆20Nov 15, 2022Updated 3 years ago