KrishnaDN/x-vector-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KrishnaDN/x-vector-pytorch)

KrishnaDN / x-vector-pytorch

Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch

☆110

Alternatives and similar repositories for x-vector-pytorch

Users that are interested in x-vector-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

manojpamk / pytorch_xvectors
View on GitHub
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
☆321Nov 11, 2020Updated 5 years ago
cvqluu / TDNN
View on GitHub
Time delay neural network (TDNN) implementation in Pytorch using unfold method
☆204Nov 21, 2019Updated 6 years ago
TaoRuijie / ECAPA-TDNN
View on GitHub
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
☆823Apr 11, 2024Updated 2 years ago
Dannynis / xvector_pytorch
View on GitHub
A pytorch implementation of xvector embedding
☆79Mar 28, 2020Updated 6 years ago
SiddGururani / Pytorch-TDNN
View on GitHub
☆99Dec 20, 2017Updated 8 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
MingjieChen / LowResourceVC
View on GitHub
Voice conversion training with 109 speakers with limited training samples
☆35Dec 21, 2020Updated 5 years ago
wangwei2009 / MSR-Identity-Toolkit-v1.0
View on GitHub
MSR Identity Toolkit v1.0
☆16Aug 18, 2017Updated 8 years ago
nghiapq77 / voice-verification
View on GitHub
Zalo AI Challenge 2020 - Top 2 @ Voice Verification
☆15Oct 4, 2022Updated 3 years ago
SilvrDuck / AccentedSpeechRecognition
View on GitHub
Experiments on speech recognition robustness to accents and dialects
☆12Apr 2, 2019Updated 7 years ago
guanlongzhao / ppg-gmm
View on GitHub
Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"
☆36Jan 15, 2020Updated 6 years ago
Snowdar / asv-subtools
View on GitHub
An Open Source Tools for Speaker Recognition
☆638Aug 5, 2024Updated last year
gteu / realtime-ppg-vc
View on GitHub
Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.
☆29Mar 3, 2022Updated 4 years ago
Anwarvic / Speaker-Recognition
View on GitHub
This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1
☆116May 22, 2019Updated 7 years ago
sarulab-speech / xvector_jtubespeech
View on GitHub
xvector model on jtubespeech
☆47Nov 5, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
MiniXC / phones
View on GitHub
A collection of utilities for handling IPA phones.
☆27Sep 24, 2023Updated 2 years ago
cadia-lvl / kaldi-speaker-diarization
View on GitHub
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
☆17Aug 12, 2024Updated last year
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
usc-sail / gard-adversarial-speaker-id
View on GitHub
Adversarial attack and defense strategies for deep speaker recognition systems
☆41Feb 18, 2021Updated 5 years ago
vvestman / pytorch-ivectors
View on GitHub
GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…
☆63Oct 15, 2019Updated 6 years ago
zeroQiaoba / ivector-xvector
View on GitHub
Extract xvector and ivector under kaldi
☆110Nov 22, 2018Updated 7 years ago
clovaai / voxceleb_trainer
View on GitHub
In defence of metric learning for speaker recognition
☆1,170Apr 22, 2026Updated 3 months ago
gzhu06 / Y-vector
View on GitHub
Y-vector: Multiscale Waveform Encoder for Speaker Embedding
☆24Jul 16, 2024Updated 2 years ago
ga642381 / Taiwanese-Translation
View on GitHub
Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus
☆13Oct 15, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
bshall / dusted
View on GitHub
DUSTED: Spoken-Term Discovery using Discrete Speech Units
☆17Oct 2, 2024Updated last year
andrekassis / Breaking-Security-Critical-Voice-Authentication
View on GitHub
Source code for paper "Breaking Security-Critical Voice Authentication".
☆13Jul 10, 2023Updated 3 years ago
saber5433 / ToneNet
View on GitHub
ToneNet: A CNN Model of Tone Classification of Mandarin Chinese
☆20Nov 27, 2019Updated 6 years ago
marianne-m / brouhaha-vad
View on GitHub
Predicts the level of noise and reverberation on your audiofiles
☆190May 23, 2026Updated 2 months ago
guanlongzhao / fac-via-ppg
View on GitHub
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)
☆147Jul 6, 2023Updated 3 years ago
nervjack2 / Speech2Unit
View on GitHub
☆13Sep 25, 2024Updated last year
RaviSoji / plda
View on GitHub
Probabilistic Linear Discriminant Analysis & classification, written in Python.
☆129Mar 28, 2022Updated 4 years ago
scelesticsiva / speaker_recognition_GMM_UBM
View on GitHub
A speaker recognition system which uses GMM-UBM for use in an Android application which helps in monitoring patients suffering from Schiz…
☆55Jun 13, 2018Updated 8 years ago
ga642381 / Taiwanese-Speech-Synthesis
View on GitHub
Taiwanese Speech Synthesis with Tacotron2
☆26Oct 2, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
jonasvdd / TDNN
View on GitHub
PyTorch implementation of a Time Delay Neural Network (TDNN)
☆41Jun 6, 2019Updated 7 years ago
pedrocolon93 / ivectormatlabmsrit
View on GitHub
I-Vector Speaker recognition system implemented with MSRIT in matlab
☆15Jan 12, 2016Updated 10 years ago
abccaba2000 / discourse-parser
View on GitHub
☆12Aug 16, 2018Updated 7 years ago
jasonppy / PromptingWhisper
View on GitHub
Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation
☆151Jan 16, 2024Updated 2 years ago
GrantL10 / My-Python-Codes-for-Acoustics
View on GitHub
Basic Tools
☆13Dec 18, 2021Updated 4 years ago
fedderrico / ubm_map_diarization
View on GitHub
Speaker diarization with GMM-UBM and MAP Adaptation
☆31Sep 13, 2018Updated 7 years ago
haoxiangsnr / Wave-U-Net-for-Speech-Enhancement
View on GitHub
Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.
☆349Oct 4, 2022Updated 3 years ago