mmorise / World
A high-quality speech analysis, manipulation and synthesis system
☆1,212Updated 3 months ago
Alternatives and similar repositories for World:
Users that are interested in World are comparing it to the libraries listed below
- A Python wrapper for the high-quality vocoder "World"☆737Updated last week
- Neural network-based singing voice synthesis library for research☆700Updated last year
- Library to build speech synthesis systems designed for easy and fast prototyping.☆395Updated 7 months ago
- Voice Conversion Tool Kit☆601Updated last year
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,588Updated 9 months ago
- A python wrapper for Speech Signal Processing Toolkit (SPTK).☆442Updated 6 months ago
- List of speech synthesis papers.☆1,017Updated last year
- A flexible source separation library in Python☆626Updated last month
- ☆395Updated 3 years ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆855Updated last year
- WaveNet-Vocoder implementation with pytorch.☆298Updated 4 years ago
- ☆151Updated last year
- Different implementations of "Weighted Prediction Error" for speech dereverberation☆499Updated 4 months ago
- Efficient neural speech synthesis☆1,149Updated 4 months ago
- A suite of speech signal processing tools☆232Updated last month
- A vocoder framework which had been widely used in research community since 1999.☆178Updated 6 years ago
- This is now the official location of the Merlin project.☆1,308Updated 4 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,301Updated 7 months ago
- CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)☆1,152Updated 5 months ago
- WaveNet vocoder☆2,340Updated last year
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆989Updated last year
- Evaluation functions for music/audio information retrieval/signal processing algorithms.☆621Updated this week
- PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)☆516Updated 4 years ago
- Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm☆678Updated 6 months ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆2,023Updated 6 months ago
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆775Updated 3 weeks ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆935Updated last year
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆320Updated last year
- Unsupervised Speech Decomposition Via Triple Information Bottleneck☆661Updated 3 months ago
- AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss☆1,027Updated 3 months ago