Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch
☆109Jul 20, 2020Updated 5 years ago
Alternatives and similar repositories for x-vector-pytorch
Users that are interested in x-vector-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆321Nov 11, 2020Updated 5 years ago
- Time delay neural network (TDNN) implementation in Pytorch using unfold method☆204Nov 21, 2019Updated 6 years ago
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆810Apr 11, 2024Updated 2 years ago
- A pytorch implementation of xvector embedding☆79Mar 28, 2020Updated 6 years ago
- ☆99Dec 20, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- MSR Identity Toolkit v1.0☆16Aug 18, 2017Updated 8 years ago
- Zalo AI Challenge 2020 - Top 2 @ Voice Verification☆15Oct 4, 2022Updated 3 years ago
- Experiments on speech recognition robustness to accents and dialects☆12Apr 2, 2019Updated 7 years ago
- An Open Source Tools for Speaker Recognition☆637Aug 5, 2024Updated last year
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Jan 15, 2020Updated 6 years ago
- Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.☆29Mar 3, 2022Updated 4 years ago
- xvector model on jtubespeech☆47Nov 5, 2023Updated 2 years ago
- This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1☆115May 22, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A collection of utilities for handling IPA phones.☆27Sep 24, 2023Updated 2 years ago
- This repository creates speaker diarization recipes to be used within the egs folder of kaldi.☆17Aug 12, 2024Updated last year
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- Adversarial attack and defense strategies for deep speaker recognition systems☆41Feb 18, 2021Updated 5 years ago
- In defence of metric learning for speaker recognition☆1,166Apr 22, 2026Updated last month
- Extract xvector and ivector under kaldi☆110Nov 22, 2018Updated 7 years ago
- Source code for paper "Breaking Security-Critical Voice Authentication".☆13Jul 10, 2023Updated 2 years ago
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆63Oct 15, 2019Updated 6 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus☆13Oct 15, 2022Updated 3 years ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆24Jul 16, 2024Updated last year
- Predicts the level of noise and reverberation on your audiofiles☆186May 18, 2026Updated last week
- ToneNet: A CNN Model of Tone Classification of Mandarin Chinese☆20Nov 27, 2019Updated 6 years ago
- ☆13Sep 25, 2024Updated last year
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆147Jul 6, 2023Updated 2 years ago
- Probabilistic Linear Discriminant Analysis & classification, written in Python.☆129Mar 28, 2022Updated 4 years ago
- PyTorch implementation of a Time Delay Neural Network (TDNN)☆41Jun 6, 2019Updated 6 years ago
- A speaker recognition system which uses GMM-UBM for use in an Android application which helps in monitoring patients suffering from Schiz…☆55Jun 13, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Taiwanese Speech Synthesis with Tacotron2☆26Oct 2, 2022Updated 3 years ago
- I-Vector Speaker recognition system implemented with MSRIT in matlab☆15Jan 12, 2016Updated 10 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆152Jan 16, 2024Updated 2 years ago
- ☆12Aug 16, 2018Updated 7 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Oct 22, 2022Updated 3 years ago
- Basic Tools☆13Dec 18, 2021Updated 4 years ago
- Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.☆349Oct 4, 2022Updated 3 years ago