Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch
☆107Jul 20, 2020Updated 5 years ago
Alternatives and similar repositories for x-vector-pytorch
Users that are interested in x-vector-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆321Nov 11, 2020Updated 5 years ago
- Time delay neural network (TDNN) implementation in Pytorch using unfold method☆204Nov 21, 2019Updated 6 years ago
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆794Apr 11, 2024Updated last year
- A pytorch implementation of xvector embedding☆79Mar 28, 2020Updated 5 years ago
- ☆99Dec 20, 2017Updated 8 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- MSR Identity Toolkit v1.0☆17Aug 18, 2017Updated 8 years ago
- Zalo AI Challenge 2020 - Top 2 @ Voice Verification☆15Oct 4, 2022Updated 3 years ago
- Experiments on speech recognition robustness to accents and dialects☆12Apr 2, 2019Updated 6 years ago
- An Open Source Tools for Speaker Recognition☆636Aug 5, 2024Updated last year
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Jan 15, 2020Updated 6 years ago
- Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.☆29Mar 3, 2022Updated 4 years ago
- xvector model on jtubespeech☆47Nov 5, 2023Updated 2 years ago
- This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1☆114May 22, 2019Updated 6 years ago
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- This repository creates speaker diarization recipes to be used within the egs folder of kaldi.☆17Aug 12, 2024Updated last year
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- Adversarial attack and defense strategies for deep speaker recognition systems☆41Feb 18, 2021Updated 5 years ago
- In defence of metric learning for speaker recognition☆1,164Mar 26, 2024Updated last year
- Source code for paper "Breaking Security-Critical Voice Authentication".☆13Jul 10, 2023Updated 2 years ago
- Extract xvector and ivector under kaldi☆110Nov 22, 2018Updated 7 years ago
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆63Oct 15, 2019Updated 6 years ago
- Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus☆13Oct 15, 2022Updated 3 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆24Jul 16, 2024Updated last year
- Predicts the level of noise and reverberation on your audiofiles☆179Jun 17, 2025Updated 9 months ago
- ToneNet: A CNN Model of Tone Classification of Mandarin Chinese☆20Nov 27, 2019Updated 6 years ago
- ☆13Sep 25, 2024Updated last year
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆148Jul 6, 2023Updated 2 years ago
- Using Kaldi x-vector method to train speaker recognition model on aishell database.☆17Aug 19, 2021Updated 4 years ago
- Probabilistic Linear Discriminant Analysis & classification, written in Python.☆131Mar 28, 2022Updated 3 years ago
- PyTorch implementation of a Time Delay Neural Network (TDNN)☆41Jun 6, 2019Updated 6 years ago
- A speaker recognition system which uses GMM-UBM for use in an Android application which helps in monitoring patients suffering from Schiz…☆55Jun 13, 2018Updated 7 years ago
- Taiwanese Speech Synthesis with Tacotron2☆25Oct 2, 2022Updated 3 years ago
- I-Vector Speaker recognition system implemented with MSRIT in matlab☆15Jan 12, 2016Updated 10 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆150Jan 16, 2024Updated 2 years ago
- ☆12Aug 16, 2018Updated 7 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Oct 22, 2022Updated 3 years ago
- Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.☆347Oct 4, 2022Updated 3 years ago