mmorise / World
A high-quality speech analysis, manipulation and synthesis system
☆1,226Updated 2 months ago
Alternatives and similar repositories for World:
Users that are interested in World are comparing it to the libraries listed below
- A Python wrapper for the high-quality vocoder "World"☆747Updated 3 months ago
- Library to build speech synthesis systems designed for easy and fast prototyping.☆397Updated 9 months ago
- Neural network-based singing voice synthesis library for research☆715Updated last year
- A python wrapper for Speech Signal Processing Toolkit (SPTK).☆441Updated 9 months ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,603Updated last year
- Voice Conversion Tool Kit☆600Updated 2 years ago
- ☆399Updated 3 years ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆860Updated last year
- This is now the official location of the Merlin project.☆1,313Updated 5 years ago
- Efficient neural speech synthesis☆1,168Updated 7 months ago
- WaveNet-Vocoder implementation with pytorch.☆298Updated 4 years ago
- Different implementations of "Weighted Prediction Error" for speech dereverberation☆515Updated last month
- List of speech synthesis papers.☆1,037Updated last year
- MelGAN vocoder (compatible with NVIDIA/tacotron2)☆644Updated 4 years ago
- The Implementation of FastSpeech based on pytorch.☆871Updated last year
- A suite of speech signal processing tools☆232Updated last month
- Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm☆695Updated 9 months ago
- WaveNet vocoder☆2,357Updated last year
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆959Updated last year
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆328Updated last year
- ☆152Updated last year
- AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss☆1,056Updated 6 months ago
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆1,005Updated last year
- Unsupervised Speech Decomposition Via Triple Information Bottleneck☆676Updated 6 months ago
- End-2-end speech synthesis with recurrent neural networks☆226Updated last year
- A vocoder framework which had been widely used in research community since 1999.☆180Updated 6 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,323Updated 10 months ago
- PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)☆516Updated 4 years ago
- CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)☆1,208Updated 8 months ago
- Speech Enhancement Generative Adversarial Network in TensorFlow☆840Updated 2 years ago