mycrazycracy/speaker-embedding-with-phonetic-information

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mycrazycracy/speaker-embedding-with-phonetic-information)

mycrazycracy / speaker-embedding-with-phonetic-information

The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"

☆45

Alternatives and similar repositories for speaker-embedding-with-phonetic-information

Users that are interested in speaker-embedding-with-phonetic-information are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mycrazycracy / tf-kaldi-speaker
View on GitHub
Neural speaker recognition/verification system based on Kaldi and Tensorflow
☆31Jun 30, 2020Updated 6 years ago
KrishnaDN / BERTphone
View on GitHub
Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
☆17Dec 10, 2020Updated 5 years ago
jefflai108 / Attentive-Filtering-Network
View on GitHub
University of Edinbrugh-Johns Hopkins University's system for ASVspoof 2017 Version 2.0 dataset.
☆50May 1, 2019Updated 7 years ago
Snowdar / asv-subtools
View on GitHub
An Open Source Tools for Speaker Recognition
☆638Aug 5, 2024Updated last year
BUTSpeechFIT / x-vector-kaldi-tf
View on GitHub
Tensorflow implementation of x-vector topology on top of Kaldi recipe
☆118Nov 5, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
bsxfan / meta-embeddings
View on GitHub
Meta-embeddings are a probabilistic generalization of embeddings in machine learning.
☆23Nov 23, 2018Updated 7 years ago
bjfu-ai-institute / speaker-recognition-papers
View on GitHub
Share some recent speaker recognition papers and their implementations.
☆89Sep 26, 2019Updated 6 years ago
clovaai / voxceleb_trainer
View on GitHub
In defence of metric learning for speaker recognition
☆1,170Apr 22, 2026Updated 3 months ago
One-Shot-Voice-Conversion-with-WIN / WINVC
View on GitHub
Official implementation of "WINVC: One-Shot Voice Conversion with Weight Adaptive Instance Normalization".
☆30Nov 13, 2021Updated 4 years ago
jefflai108 / pytorch-kaldi-neural-speaker-embeddings
View on GitHub
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
☆136Jan 27, 2020Updated 6 years ago
rgzn-aiyun / melgan-cpu
View on GitHub
Real-time melgan based on cpu ！！！
☆13Dec 3, 2019Updated 6 years ago
cschaefer26 / StyleMelGAN
View on GitHub
☆10Apr 8, 2024Updated 2 years ago
xcmyz / FastVocoder
View on GitHub
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
☆157Jul 2, 2021Updated 5 years ago
the-anonymous-bs / av-SALMONN
View on GitHub
av-SALMONN: Speech-Enhanced Audio-Visual Large Language Models
☆13May 8, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
unilight / LDNet
View on GitHub
Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"
☆68Dec 13, 2021Updated 4 years ago
VITA-Group / AutoSpeech
View on GitHub
[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …
☆206Dec 8, 2022Updated 3 years ago
shkim816 / acnn_speaker_recog
View on GitHub
acnn for text-independent speaker recognition
☆10Feb 8, 2022Updated 4 years ago
fangfm / lcnn
View on GitHub
A TensorFlow implementation of light convolutional neural network (LCNN)
☆12Dec 27, 2018Updated 7 years ago
WeidiXie / VGG-Speaker-Recognition
View on GitHub
Utterance-level Aggregation For Speaker Recognition In The Wild
☆371Mar 24, 2023Updated 3 years ago
Labmem-Zhouyx / CDFSE_FastSpeech2
View on GitHub
The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…
☆86Dec 20, 2022Updated 3 years ago
zjumml / DiffSinger
View on GitHub
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
☆10Mar 8, 2022Updated 4 years ago
babe269 / performant
View on GitHub
A toolset for easy formant extraction and visualization from wav files and TTS models
☆33Sep 2, 2022Updated 3 years ago
hhguo / FastGriffinLim_Pytorch
View on GitHub
☆13Nov 16, 2020Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
zdyshine / beat_track_mgtv_baseline
View on GitHub
☆16Jul 20, 2021Updated 5 years ago
mycrazycracy / Backends-for-SRE19
View on GitHub
This repository will illustrate the use of some different backends on NIST SRE 2019.
☆21Apr 25, 2020Updated 6 years ago
zjlww / zjlww.github.io
View on GitHub
☆12Feb 26, 2023Updated 3 years ago
Jungjee / RawNet
View on GitHub
Official repository for RawNet, RawNet2, and RawNet3
☆407Mar 21, 2024Updated 2 years ago
hhguo / WaveRNN
View on GitHub
Based on https://github.com/fatchord/WaveRNN
☆24May 3, 2020Updated 6 years ago
lawlict / ECAPA-TDNN
View on GitHub
☆106Sep 2, 2021Updated 4 years ago
qqueing / SR_with_kaldi
View on GitHub
Speaker embedding(verification and recognition) using Tensorflow with Kaldi
☆41Sep 18, 2017Updated 8 years ago
bsxfan / PSDA
View on GitHub
Probabilistic Spherical Discriminant Analysis
☆12Oct 29, 2022Updated 3 years ago
azraelkuan / asvspoof2017
View on GitHub
an implement of asvspoof 2017 using pytorch
☆21Jan 8, 2018Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
RicherMans / PLDA
View on GitHub
An LDA/PLDA estimator using KALDI in python for speaker verification tasks
☆102Apr 15, 2017Updated 9 years ago
wq2012 / SpeakerRecognitionFromScratch
View on GitHub
Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家
☆47May 7, 2024Updated 2 years ago
BUTSpeechFIT / MultiSV
View on GitHub
MultiSV: scripts for data preparation
☆31Jan 18, 2025Updated last year
cnlinxi / tpse_tacotron2
View on GitHub
TPSE-GST Tacotron2
☆14May 1, 2019Updated 7 years ago
awslabs / speech-representations
View on GitHub
Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)
☆104Nov 26, 2022Updated 3 years ago
PHJhjpeng1992 / awesome-asv-antispoofing
View on GitHub
This is a curated list of awesome ASV(Automatic Speaker Verification) Anti-Spoofing papers, libraries, datasets, and other resources.
☆22May 21, 2021Updated 5 years ago
Lhx94As / PHO-LID
View on GitHub
PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification
☆21Aug 24, 2023Updated 2 years ago