ranchlai / awesome-speaker-embeddingView external linksLinks
A curated list of speaker-embedding speaker-verification, speaker-identification resources.
☆52Aug 12, 2021Updated 4 years ago
Alternatives and similar repositories for awesome-speaker-embedding
Users that are interested in awesome-speaker-embedding are comparing it to the libraries listed below
Sorting:
- 语音增强领域的相关数据仿真工具和方法汇总--持续更新☆45Jul 11, 2024Updated last year
- Automatically Update LLM Papers Daily using Github Actions. Ref: https://github.com/Vincentqyw/cv-arxiv-daily☆10Updated this week
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Oct 27, 2022Updated 3 years ago
- Digital Speech Processing in PyTorch.☆15Aug 12, 2022Updated 3 years ago
- Official repository for "Structure-Enhanced Pop Music Generation via Harmony-Aware Learning", ACM MM 2022.☆15Mar 22, 2023Updated 2 years ago
- ☆10May 15, 2021Updated 4 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- Multispeaker Community Vocoder Model for DiffSinger☆39Aug 11, 2025Updated 6 months ago
- a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine☆41Jan 17, 2025Updated last year
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on mul…☆17Jan 20, 2025Updated last year
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆96Sep 15, 2021Updated 4 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆286Jan 8, 2024Updated 2 years ago
- MFA acoustic model training based on Opencpop☆15Sep 23, 2022Updated 3 years ago
- A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.☆15Aug 29, 2021Updated 4 years ago
- MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows☆123Sep 2, 2025Updated 5 months ago
- Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textg…☆14Feb 9, 2024Updated 2 years ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- In defence of metric learning for speaker recognition☆1,161Mar 26, 2024Updated last year
- Learning differentiable temporal resolution on time-series data.☆36Nov 12, 2022Updated 3 years ago
- ☆12Jun 14, 2022Updated 3 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆320Nov 11, 2020Updated 5 years ago
- Variational Bayes HMM over x-vectors diarization☆283Jan 15, 2024Updated 2 years ago
- The demo page for ALMTokenizer☆58Apr 14, 2025Updated 10 months ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Sep 19, 2022Updated 3 years ago
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆75Aug 21, 2023Updated 2 years ago
- NMT model with BERT in tensorflow 2.0☆20Jul 24, 2019Updated 6 years ago
- Supplementary Materials of ISMIR 2022 paper "Analysis and detection of singing techniques in repertoires of J-POP solo singers" by Yuya Y…☆23Apr 23, 2024Updated last year
- Ultrafast GAN based Vocoder for Text to Speech☆50Jul 16, 2022Updated 3 years ago
- Pytorch implementation of subband decomposition☆92Jul 26, 2022Updated 3 years ago
- torch version of LPCNet☆22Jul 8, 2020Updated 5 years ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Apr 1, 2021Updated 4 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Jul 10, 2019Updated 6 years ago
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆133Jun 10, 2022Updated 3 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆134Nov 29, 2023Updated 2 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆43Jul 17, 2020Updated 5 years ago
- Adaptive Vocoder for Custom Voice☆61Sep 22, 2022Updated 3 years ago
- ARCH: Audio Representations benCHmark☆53Aug 26, 2024Updated last year