mmorise / WorldLinks

A high-quality speech analysis, manipulation and synthesis system

☆1,250

Alternatives and similar repositories for World

Users that are interested in World are comparing it to the libraries listed below

Sorting:

JeremyCCHsu / Python-Wrapper-for-World-Vocoder
A Python wrapper for the high-quality vocoder "World"
☆763Updated 6 months ago
k2kobayashi / sprocket
Voice Conversion Tool Kit
☆603Updated 2 years ago
r9y9 / nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
☆398Updated last year
nnsvs / nnsvs
Neural network-based singing voice synthesis library for research
☆724Updated last year
kan-bayashi / ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
☆1,611Updated last year
r9y9 / pysptk
A python wrapper for Speech Signal Processing Toolkit (SPTK).
☆445Updated last year
google / REAPER
☆405Updated 3 years ago
r9y9 / gantts
PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
☆516Updated 4 years ago
NVIDIA / mellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…
☆861Updated 2 years ago
CSTR-Edinburgh / merlin
This is now the official location of the Merlin project.
☆1,316Updated 5 years ago
f90 / Wave-U-Net
Implementation of the Wave-U-Net for audio source separation
☆902Updated 2 years ago
wenet-e2e / speech-synthesis-paper
List of speech synthesis papers.
☆1,052Updated 2 years ago
xiph / LPCNet
Efficient neural speech synthesis
☆1,182Updated 10 months ago
kan-bayashi / PytorchWaveNetVocoder
WaveNet-Vocoder implementation with pytorch.
☆300Updated 5 years ago
ina-foss / inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …
☆827Updated 6 months ago
r9y9 / wavenet_vocoder
WaveNet vocoder
☆2,360Updated 2 years ago
marl / crepe
CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)
☆1,255Updated 11 months ago
tuanad121 / Python-WORLD
☆152Updated last year
aliutkus / speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
☆988Updated 2 years ago
drethage / speech-denoising-wavenet
A neural network for end-to-end speech denoising
☆700Updated 2 years ago
auspicious3000 / autovc
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
☆1,071Updated 9 months ago
csteinmetz1 / pyloudnorm
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
☆713Updated last year
HidekiKawahara / legacy_STRAIGHT
A vocoder framework which had been widely used in research community since 1999.
☆181Updated 6 years ago
santi-pdp / segan
Speech Enhancement Generative Adversarial Network in TensorFlow
☆844Updated 2 years ago
xcmyz / FastSpeech
The Implementation of FastSpeech based on pytorch.
☆873Updated 2 years ago
descriptinc / melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
☆1,015Updated last year
fgnt / nara_wpe
Different implementations of "Weighted Prediction Error" for speech dereverberation
☆528Updated 4 months ago
sp-nitech / SPTK
A suite of speech signal processing tools
☆238Updated last week
hujinsen / StarGAN-Voice-Conversion
full tensorflow implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial netw…
☆273Updated last year
Kyubyong / css10
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
☆475Updated 5 years ago