facebookresearch / vocoder-benchmark
A repository for benchmarking neural vocoders by their quality and speed.
☆201Updated 3 weeks ago
Related projects: ⓘ
- Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"☆187Updated last year
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆115Updated 2 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆101Updated 3 years ago
- ☆161Updated 2 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆133Updated 2 years ago
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆139Updated last year
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆151Updated 2 years ago
- ICASSP 2023 Accepted☆189Updated 4 months ago
- MOS score prediction by fine-tuned wav2vec2.0 model☆135Updated last year
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆126Updated 9 months ago
- Pitch Estimating Neural Networks (PENN)☆227Updated last month
- ☆160Updated 2 years ago
- PyTorch Implementation of Multi-Singer (ACM-MM'21)☆138Updated 2 years ago
- An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…☆114Updated 3 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆109Updated 2 years ago
- Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"☆177Updated 10 months ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆188Updated 2 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆168Updated last month
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆159Updated 5 months ago
- Reference-aware automatic speech evaluation toolkit☆95Updated 6 months ago
- Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training☆130Updated last year
- Official implementation of SpeechSplit2☆126Updated last year
- A differentiable version of SPTK☆160Updated this week
- An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"☆131Updated last year
- ☆96Updated 3 years ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆221Updated last year
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning☆108Updated 3 months ago
- An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"☆123Updated 3 years ago
- Expressive Anechoic Recordings of Speech (EARS)☆123Updated 2 months ago
- UT-Sarulab MOS prediction system using SSL models☆163Updated 5 months ago