CODEJIN/Speaker_Embedding_Torch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CODEJIN/Speaker_Embedding_Torch)

CODEJIN / Speaker_Embedding_Torch

PyTorch based speaker embedding model

☆16

Alternatives and similar repositories for Speaker_Embedding_Torch

Users that are interested in Speaker_Embedding_Torch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hrnoh / f0-autovc
View on GitHub
Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"
☆29Nov 6, 2020Updated 5 years ago
cyhuang-tw / AutoVC
View on GitHub
An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".
☆34Apr 26, 2021Updated 5 years ago
CODEJIN / AutoVC
View on GitHub
☆30Jun 30, 2020Updated 6 years ago
Jackson-Kang / Prosody-augmentation-for-Text-to-speech
View on GitHub
Simple tool for speech dataset augmentation for modeling various prosodies.
☆14Jan 14, 2021Updated 5 years ago
XierHacker / ChineseWordSegment
View on GitHub
Tensorflow Implements Chinese Word Segment use LSTM+CRF and Dilated CNN+CRF
☆15Jul 16, 2018Updated 8 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
shkim816 / acnn_speaker_recog
View on GitHub
acnn for text-independent speaker recognition
☆10Feb 8, 2022Updated 4 years ago
freenowill / AutoVC-WavRNN
View on GitHub
voice conversion system
☆25Jun 10, 2020Updated 6 years ago
MingjieChen / VoiceConversionGANs
View on GitHub
GAN series for voice conversion on VCC2018 dataset
☆17Aug 27, 2020Updated 5 years ago
Akella17 / speaker-embedding
View on GitHub
A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack
☆10Feb 19, 2018Updated 8 years ago
revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
cschaefer26 / StyleMelGAN
View on GitHub
☆10Apr 8, 2024Updated 2 years ago
peisuke / AutoVC.pytorch
View on GitHub
☆23Jul 4, 2020Updated 6 years ago
tli725 / JL-Corpus
View on GitHub
For further understanding the wide array of emotions embedded in human speech, we are introducing an emotional speech corpus. In contrast…
☆11Oct 29, 2018Updated 7 years ago
rgzn-aiyun / melgan-cpu
View on GitHub
Real-time melgan based on cpu ！！！
☆13Dec 3, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ajinkyakulkarni14 / ERISHA
View on GitHub
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆44Dec 17, 2020Updated 5 years ago
rhoposit / multilingual_VQVAE
View on GitHub
☆37May 8, 2021Updated 5 years ago
jefflai108 / pytorch-kaldi-neural-speaker-embeddings
View on GitHub
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
☆136Jan 27, 2020Updated 6 years ago
shaojinding / Adversarial-Many-to-Many-VC
View on GitHub
[InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …
☆39Mar 24, 2023Updated 3 years ago
CODEJIN / Glow_TTS
View on GitHub
An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.
☆55Sep 14, 2022Updated 3 years ago
KimythAnly / AGAIN-VC
View on GitHub
This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…
☆114Dec 7, 2020Updated 5 years ago
ttaoREtw / semi-tts
View on GitHub
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
☆39Jul 16, 2020Updated 6 years ago
bagustris / SER_ICSigSys2019
View on GitHub
Repository of code for Speech emotion recognition using voiced speech and attention model, submitted to ICSigSys 2019
☆13Jan 6, 2020Updated 6 years ago
LouisBearing / UnconditionalHeadMotion
View on GitHub
Code & demo for the animation of still facial landmarks from an initial pose.
☆15Jan 19, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
yistLin / FragmentVC
View on GitHub
Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
☆204Nov 30, 2020Updated 5 years ago
ertug / Weak_Class_Source_Separation
View on GitHub
Source code and audio demos for the paper "Audio Source Separation Using Variational Autoencoders and Weak Class Supervision"
☆11Jun 21, 2026Updated last month
skgusrb12 / voice_activity_detection
View on GitHub
Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)
☆27Mar 20, 2021Updated 5 years ago
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
xcmyz / Tacotron2-Pytorch
View on GitHub
follow NVIDIA, simplify it and support data parallel.
☆13Sep 26, 2019Updated 6 years ago
narVidhai / Speech-Transcription-Benchmarking
View on GitHub
Example python scripts to evaluate various ASR methods
☆11Dec 22, 2021Updated 4 years ago
JunhoKim94 / ASR_project
View on GitHub
This repository created for the NHN ASR hackathon competition.
☆11Sep 20, 2023Updated 2 years ago
hash2430 / pitchtron
View on GitHub
TTS for pitch-accented language. Korean dialect DB.
☆155May 12, 2023Updated 3 years ago
KnurpsBram / AutoVC_WavenetVocoder_GriffinLim_experiments
View on GitHub
Experiments on AutoVC and WaveNet vocoder, compared against the Griffin Lim spectrogram inversion algorithm
☆11Jun 18, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
KunZhou9646 / seq2seq-EVC
View on GitHub
This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…
☆87Dec 31, 2022Updated 3 years ago
KrishnaDN / BERTphone
View on GitHub
Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
☆17Dec 10, 2020Updated 5 years ago
warnikchow / kosp2e
View on GitHub
Korean Speech to English Translation Corpus
☆45Sep 3, 2021Updated 4 years ago
nc-ai / speech
View on GitHub
☆17Aug 27, 2025Updated 10 months ago
cnlinxi / tpse_tacotron2
View on GitHub
TPSE-GST Tacotron2
☆14May 1, 2019Updated 7 years ago
foamliu / Speaker-Embeddings
View on GitHub
PyTorch implementation of a self-attentive speaker embedding
☆17Sep 24, 2019Updated 6 years ago
yeongseon / PyCon-KR-2019
View on GitHub
Tutorial session material of Pytest in PyCon KR 2019
☆10Updated this week