xiph / LPCNet
Efficient neural speech synthesis
β1,147Updated 3 months ago
Alternatives and similar repositories for LPCNet:
Users that are interested in LPCNet are comparing it to the libraries listed below
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorchβ1,584Updated 8 months ago
- The Implementation of FastSpeech based on pytorch.β862Updated last year
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,298Updated 7 months ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )β535Updated 2 years ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing tβ¦β855Updated last year
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β953Updated this week
- End-2-end speech synthesis with recurrent neural networksβ225Updated 10 months ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languagesβ468Updated 4 years ago
- MelGAN vocoder (compatible with NVIDIA/tacotron2)β641Updated 4 years ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Searchβ674Updated 2 years ago
- g2p: English Grapheme To Phoneme Conversionβ831Updated 2 years ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.β847Updated 3 years ago
- A Python wrapper for the high-quality vocoder "World"β736Updated last year
- The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the numberβ¦β500Updated 6 months ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech procβ¦β366Updated last month
- Reference implementation of real-time autoregressive wavenet inferenceβ737Updated 3 years ago
- Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.β502Updated 2 years ago
- Open tools and data for cloudless automatic speech recognitionβ446Updated 3 years ago
- PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)β516Updated 4 years ago
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesisβ989Updated last year
- Speech Enhancement Generative Adversarial Network in TensorFlowβ829Updated last year
- Large, modern dataset for speech recognitionβ656Updated 10 months ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networksβ429Updated 4 years ago
- Deep Speaker: an End-to-End Neural Speaker Embedding System.β911Updated 9 months ago
- SincNet is a neural architecture for efficiently processing raw audio samples.β1,146Updated 3 years ago
- This is now the official location of the Merlin project.β1,307Updated 4 years ago
- WaveNet vocoderβ2,337Updated last year
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"β368Updated 6 years ago
- Perceptual Quality Estimator for speech and audioβ723Updated 5 months ago
- A python wrapper for Speech Signal Processing Toolkit (SPTK).β442Updated 6 months ago