mycrazycracy / speaker-embedding-with-phonetic-informationView external linksLinks
The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"
☆45Jul 10, 2019Updated 6 years ago
Alternatives and similar repositories for speaker-embedding-with-phonetic-information
Users that are interested in speaker-embedding-with-phonetic-information are comparing it to the libraries listed below
Sorting:
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Dec 10, 2020Updated 5 years ago
- Neural speaker recognition/verification system based on Kaldi and Tensorflow☆32Jun 30, 2020Updated 5 years ago
- ☆10Apr 8, 2024Updated last year
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- Probabilistic Spherical Discriminant Analysis☆12Oct 29, 2022Updated 3 years ago
- University of Edinbrugh-Johns Hopkins University's system for ASVspoof 2017 Version 2.0 dataset.☆50May 1, 2019Updated 6 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Nov 23, 2018Updated 7 years ago
- Tensorflow implementation of x-vector topology on top of Kaldi recipe☆120Nov 5, 2019Updated 6 years ago
- Gaussian Mixture VAE Tacotron☆53Jul 6, 2023Updated 2 years ago
- MultiSV: scripts for data preparation☆30Jan 18, 2025Updated last year
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"☆68Dec 13, 2021Updated 4 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆157Jul 2, 2021Updated 4 years ago
- TPSE-GST Tacotron2☆14May 1, 2019Updated 6 years ago
- Share some recent speaker recognition papers and their implementations.☆90Sep 26, 2019Updated 6 years ago
- ☆34Jul 16, 2019Updated 6 years ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆21Jan 18, 2023Updated 3 years ago
- In defence of metric learning for speaker recognition☆1,161Mar 26, 2024Updated last year
- An Open Source Tools for Speaker Recognition☆634Aug 5, 2024Updated last year
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Apr 11, 2021Updated 4 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Jan 27, 2020Updated 6 years ago
- ☆37May 8, 2021Updated 4 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆87Dec 20, 2022Updated 3 years ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Mar 7, 2023Updated 2 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 9 years ago
- [InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …☆209Dec 8, 2022Updated 3 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44May 9, 2023Updated 2 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆71Aug 8, 2022Updated 3 years ago
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion☆104Jul 12, 2023Updated 2 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 3 years ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Official repository for RawNet, RawNet2, and RawNet3☆396Mar 21, 2024Updated last year
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- Audio samples for the paper "TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids"☆48Jun 3, 2020Updated 5 years ago
- Adaptive Vocoder for Custom Voice☆61Sep 22, 2022Updated 3 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"☆11Mar 24, 2023Updated 2 years ago