RF5/simple-speaker-embedding

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RF5/simple-speaker-embedding)

RF5 / simple-speaker-embedding

A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.

☆91

Alternatives and similar repositories for simple-speaker-embedding

Users that are interested in simple-speaker-embedding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RF5 / simple-autovc
View on GitHub
A simple, performant re-implementation of AutoVC
☆22Jul 6, 2023Updated 3 years ago
Akella17 / speaker-embedding
View on GitHub
A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack
☆10Feb 19, 2018Updated 8 years ago
nii-yamagishilab / Attention_Backend_for_ASV
View on GitHub
Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances
☆50Oct 27, 2022Updated 3 years ago
taylorlu / ghostvlad-speaker
View on GitHub
An tensorflow implementation of ghostvlad for speaker recognition
☆15May 2, 2019Updated 7 years ago
EricWilbanks / faseAlign
View on GitHub
Command line tool for forced-alignment of Spanish speech data
☆13Dec 31, 2025Updated 6 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
oscarknagg / raw-audio-gender-classification
View on GitHub
Machine learning experiment to perform gender classification from raw audio.
☆23Sep 1, 2018Updated 7 years ago
yistLin / dvector
View on GitHub
Speaker embedding (d-vector) trained with GE2E loss
☆289Jan 8, 2024Updated 2 years ago
RF5 / simple-asgan
View on GitHub
Training code and trained checkpoints for ASGAN.
☆62Dec 27, 2023Updated 2 years ago
thuhcsi / VAENAR-TTS
View on GitHub
The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.
☆144Jul 8, 2021Updated 5 years ago
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
zzw922cn / wesinger2
View on GitHub
Synthesized singing voice demos of WeSinger 2 paper.
☆26Feb 20, 2023Updated 3 years ago
cvqluu / MTL-Speaker-Embeddings
View on GitHub
Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…
☆26Oct 5, 2022Updated 3 years ago
innnky / FreeSVC
View on GitHub
基于FreeVC的歌声转换
☆21Dec 16, 2022Updated 3 years ago
yl4579 / PitchExtractor
View on GitHub
Deep Neural Pitch Extractor for Voice Conversion and TTS Training
☆151Aug 22, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
CODEJIN / XiaoiceSing2
View on GitHub
☆19Feb 2, 2023Updated 3 years ago
nii-yamagishilab / vctk-silence-labels
View on GitHub
☆25Oct 4, 2022Updated 3 years ago
rrlyman / phase-reconstruction
View on GitHub
phase reconstruction from magnitude terms of an STFT
☆13May 18, 2025Updated last year
v-nhandt21 / MusicVoiceConversion
View on GitHub
Sing any popular song with your voice
☆11Jul 10, 2022Updated 4 years ago
Labmem-Zhouyx / CDFSE_FastSpeech2
View on GitHub
The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…
☆86Dec 20, 2022Updated 3 years ago
dihardchallenge / dihard3_baseline
View on GitHub
☆30Jul 21, 2022Updated 4 years ago
aqtq314 / VogenSVS
View on GitHub
☆15Apr 16, 2026Updated 3 months ago
redmist328 / APNet2
View on GitHub
Source code of APNet2, a vocoder
☆60Nov 23, 2023Updated 2 years ago
rishikksh20 / HiFiplusplus-pytorch
View on GitHub
HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement
☆160Jul 16, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
maum-ai / assem-vc
View on GitHub
Official Code for Assem-VC @ICASSP2022
☆269May 16, 2022Updated 4 years ago
acetylSv / non-parallel-rhythm-flexible-VC
View on GitHub
PyTorch implementation of: Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences
☆11Jul 18, 2019Updated 7 years ago
yl4579 / AuxiliaryASR
View on GitHub
Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)
☆127Jun 16, 2022Updated 4 years ago
cyhuang-tw / AutoVC
View on GitHub
An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".
☆34Apr 26, 2021Updated 5 years ago
manojpamk / pytorch_xvectors
View on GitHub
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
☆321Nov 11, 2020Updated 5 years ago
shangeth / wavencoder
View on GitHub
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…
☆92Jun 6, 2021Updated 5 years ago
scf4 / OpenMusicLM
View on GitHub
Update: Ignore this repo, check out @lucidrains' implementation https://github.com/lucidrains/musiclm-pytorch
☆15Jan 27, 2023Updated 3 years ago
seahore / PPG-GradVC
View on GitHub
A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis
☆45Jul 24, 2023Updated 3 years ago
RF5 / transfusion-asr
View on GitHub
Transcribing Speech with Multinomial Diffusion, training code and models.
☆80Sep 27, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
dendisuhubdy / ADMMGLA
View on GitHub
Griffin-Lim Like Phase Recovery via Alternating Direction Method of Multipliers (Yoshiki Masuyama et al., 2018)
☆13Dec 17, 2018Updated 7 years ago
zengchang233 / xiaoicesing2
View on GitHub
The source code for the paper XiaoiceSing2 (interspeech2023)
☆49Jan 15, 2024Updated 2 years ago
ndkgit339 / spe-dss
View on GitHub
Speech Parameter Estimation Using Differentiable Speech Synthesizer
☆43May 9, 2023Updated 3 years ago
suhitaghosh10 / emo-stargan
View on GitHub
Implementation of Emo-StarGAN
☆48Dec 19, 2023Updated 2 years ago
Edresson / VoiceSplit
View on GitHub
VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram
☆271Jul 25, 2024Updated 2 years ago
chomeyama / SiFiGAN
View on GitHub
Official implementation of the source-filter HiFiGAN vocoder
☆275Jul 29, 2023Updated 2 years ago
ujscjj / DPTNet
View on GitHub
☆119Jan 8, 2021Updated 5 years ago