A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
☆136Jan 27, 2020Updated 6 years ago
Alternatives and similar repositories for pytorch-kaldi-neural-speaker-embeddings
Users that are interested in pytorch-kaldi-neural-speaker-embeddings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020☆22Oct 29, 2020Updated 5 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Mar 29, 2022Updated 4 years ago
- Neural speaker recognition/verification system based on Kaldi and Tensorflow☆31Jun 30, 2020Updated 5 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆103Mar 18, 2019Updated 7 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆321Nov 11, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)☆100Apr 20, 2020Updated 6 years ago
- Tensorflow implementation of x-vector topology on top of Kaldi recipe☆119Nov 5, 2019Updated 6 years ago
- ☆35Apr 8, 2019Updated 7 years ago
- ☆37May 8, 2021Updated 4 years ago
- ☆45Oct 24, 2020Updated 5 years ago
- An Open Source Tools for Speaker Recognition☆636Aug 5, 2024Updated last year
- Speaker embedding(verification and recognition) using Pytorch☆369Jul 24, 2020Updated 5 years ago
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆63Oct 15, 2019Updated 6 years ago
- Interface for Controllable Expressive Talking Machine☆40Sep 20, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- D3M - Dynamic Data Discrepancy Mitigation for Anti-spoofing - Implementation of work Dynamically Mitigating Data Discrepancy with Balance…☆30Feb 15, 2023Updated 3 years ago
- Deep Speaker: an End-to-End Neural Speaker Embedding System.☆940Apr 13, 2024Updated 2 years ago
- An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" …☆114Jun 19, 2020Updated 5 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆287Jan 8, 2024Updated 2 years ago
- Collection of self-supervised models for speaker and language recognition tasks.☆19Jan 18, 2022Updated 4 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆213Jul 17, 2020Updated 5 years ago
- Time delay neural network (TDNN) implementation in Pytorch using unfold method☆204Nov 21, 2019Updated 6 years ago
- Keras implementation of SincNet (https://github.com/mravanelli/SincNet, https://arxiv.org/abs/1808.00158)☆12Aug 5, 2018Updated 7 years ago
- PyTorch implementation of a self-attentive speaker embedding☆17Sep 24, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and …☆149Jan 6, 2020Updated 6 years ago
- In defence of metric learning for speaker recognition☆1,165Updated this week
- PyTorch based speaker embedding model☆16Apr 13, 2024Updated 2 years ago
- ☆51Feb 15, 2019Updated 7 years ago
- [InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …☆208Dec 8, 2022Updated 3 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Jul 10, 2019Updated 6 years ago
- Problem Agnostic Speech Encoder☆447Jul 6, 2023Updated 2 years ago
- A WaveRNN implementation☆201Oct 14, 2019Updated 6 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆44Dec 17, 2020Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.☆598Jan 20, 2022Updated 4 years ago
- the Tensorflow version of multi-speaker TTS training with feedback constraint☆40Oct 12, 2020Updated 5 years ago
- A pure python module for reading and writing kaldi ark files☆268Mar 6, 2025Updated last year
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆169Jul 6, 2023Updated 2 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- ☆23Jun 28, 2019Updated 6 years ago
- VoxSRC Challenge☆31Jun 11, 2019Updated 6 years ago