azraelkuan/voice-conversion

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/azraelkuan/voice-conversion)

azraelkuan / voice-conversion

an tutorial implement of voice conversion using pytorch

☆34

Alternatives and similar repositories for voice-conversion

Users that are interested in voice-conversion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TanUkkii007 / deepvoice3-tensorflow
View on GitHub
A tensorflow based implementation of DeepVoice3 https://arxiv.org/abs/1710.07654
☆13Jun 5, 2018Updated 8 years ago
MlWoo / sentence2pinyin
View on GitHub
tts fronted-end
☆11Dec 19, 2018Updated 7 years ago
MingjieChen / LowResourceVC
View on GitHub
Voice conversion training with 109 speakers with limited training samples
☆35Dec 21, 2020Updated 5 years ago
jxzhanggg / nonparaSeq2seqVC_code
View on GitHub
Implementation code of non-parallel sequence-to-sequence VC
☆248Mar 24, 2023Updated 3 years ago
JeremyCCHsu / vc-vawgan
View on GitHub
Network specification and demo
☆35Jun 5, 2017Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
huiw39 / ExtensibleTTS-PyTorch
View on GitHub
An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery
☆26Jun 24, 2019Updated 7 years ago
CSTR-Edinburgh / ophelia
View on GitHub
Sequence-to-sequence TTS based on Kyubyong's dc_tts
☆61Feb 2, 2023Updated 3 years ago
candlewill / RawNet
View on GitHub
RawNet: Fast End-to-End Neural Vocoder
☆43May 29, 2019Updated 7 years ago
carl-robinson / voice-emotion-seq2seq
View on GitHub
Voice emotion conversion model for DS/ML master's thesis. F0 contour mapping in sequence-to-sequence RNN-LSTM architecture in Tensorflow.
☆27Oct 30, 2018Updated 7 years ago
makerjackie / MTTS
View on GitHub
A Demo of Mandarin/Chinese TTS frontend
☆284Apr 18, 2022Updated 4 years ago
entn-at / DurIAN-1
View on GitHub
Implementation of "DurIAN: Duration Informed Attention Network For Multimodal Synthesis".
☆15Jul 6, 2020Updated 6 years ago
JeremyCCHsu / vae-npvc
View on GitHub
Re-implementation the code used in Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder
☆149Aug 11, 2019Updated 6 years ago
nii-yamagishilab / tacotron2
View on GitHub
An implementation of Tacotron and Tacotron2
☆80Aug 4, 2021Updated 4 years ago
k2kobayashi / sprocket
View on GitHub
Voice Conversion Tool Kit
☆608Feb 27, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
guanlongzhao / ppg-gmm
View on GitHub
Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"
☆36Jan 15, 2020Updated 6 years ago
qianjr2002 / WavToolKit
View on GitHub
音频处理小工具
☆14Jun 4, 2026Updated last month
timmahrt / pyJuliusAlign
View on GitHub
One-button-press forced aligner for Japanese, using Julius.
☆48Jul 15, 2023Updated 3 years ago
Dystopiaz / wake-up-android
View on GitHub
语音唤醒
☆13Dec 12, 2018Updated 7 years ago
yoyolicoris / pytorch_FFTNet
View on GitHub
A pytorch implementation of FFTNet.
☆37Aug 31, 2018Updated 7 years ago
KunZhou9646 / Speaker-independent-emotional-voice-conversion-based-on-conditional-VAW-GAN-and-CWT
View on GitHub
This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…
☆90Nov 13, 2020Updated 5 years ago
patriceguyot / Yin
View on GitHub
Fast Python implementation of the Yin algorithm: a fundamental frequency estimator
☆108Oct 19, 2022Updated 3 years ago
MingjieChen / DYGANVC
View on GitHub
demo page https://MingjieChen.github.io/dygan-vc
☆66Apr 13, 2022Updated 4 years ago
chaiyujin / dctts-pytorch
View on GitHub
The pytorch implementation of DC-TTS
☆76Jun 20, 2018Updated 8 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
wnhsu / ScalableFHVAE
View on GitHub
This repository contains the code to reproduce the core results from the paper "Scalable Factorized Hierarchical Variational Autoencoders…
☆53Apr 11, 2018Updated 8 years ago
liusongxiang / StarGAN-Voice-Conversion
View on GitHub
This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial ne…
☆523Oct 11, 2019Updated 6 years ago
njellinas / GAN-Voice-Conversion
View on GitHub
Implementation of GAN architectures for Voice Conversion
☆51May 13, 2019Updated 7 years ago
matrix-io / matrix-malos-wakeword
View on GitHub
MALOS wake word service (Deprecated)
☆16May 10, 2019Updated 7 years ago
itsuki8914 / Voice-morphing-RelGAN
View on GitHub
A implementation voice morphing using relgan with tensorflow
☆25Mar 24, 2023Updated 3 years ago
auspicious3000 / autovc
View on GitHub
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
☆1,100Oct 23, 2024Updated last year
ryuryukke / japanese_summarizer
View on GitHub
A summarizer for Japanese articles (but ChatGPT is better)
☆10Aug 1, 2022Updated 3 years ago
candlewill / CNTN
View on GitHub
ChiNese Text Normalization (CNTN) tool for Text-to-speech system
☆37Apr 12, 2018Updated 8 years ago
stoneMo / ASVspoof
View on GitHub
☆19Dec 8, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
makerjackie / tts-frontend-dataset
View on GitHub
TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization
☆104Feb 5, 2024Updated 2 years ago
keonlee9420 / Cross-Speaker-Emotion-Transfer
View on GitHub
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…
☆194Nov 9, 2022Updated 3 years ago
yoyolicoris / wavenet-like-vocoder
View on GitHub
Basic wavenet and fftnet vocoder model.
☆19Feb 7, 2022Updated 4 years ago
CODEJIN / Glow_TTS
View on GitHub
An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.
☆55Sep 14, 2022Updated 3 years ago
MattShannon / mcd
View on GitHub
Mel cepstral distortion (MCD) computations in python.
☆231Jun 13, 2017Updated 9 years ago
jdvala / zoom_audio_transcribe
View on GitHub
Zoom Audio Transcription offline
☆34Sep 30, 2020Updated 5 years ago
yanggeng1995 / WaveGlow
View on GitHub
A tensorflow implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis
☆20Oct 23, 2019Updated 6 years ago