shakingWaves / LPCNet_torch
torch version of LPCNet
☆20Updated 4 years ago
Alternatives and similar repositories for LPCNet_torch:
Users that are interested in LPCNet_torch are comparing it to the libraries listed below
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- TTS Text Analyzer☆32Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆41Updated 3 years ago
- A Pytorch version of LPCNet, including dump weight☆32Updated 2 years ago
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆61Updated 3 years ago
- Production first, nn-based on-device signal processing toolkit.☆64Updated last year
- Mutiband version of HIFIGAN☆17Updated 4 years ago
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆32Updated 4 years ago
- RepVgg + HiFiGAN☆33Updated 2 years ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆49Updated last month
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Updated 5 years ago
- Chinese Prosodic Structure Prediction☆10Updated 5 years ago
- pytorch CTC implementation for ASR. Use eesen's fst decoder framework☆10Updated 4 years ago
- Spherical residual vector quantization (SRVQ)☆28Updated 5 months ago
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Updated 5 years ago
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆43Updated 6 months ago
- ☆48Updated last year
- ☆25Updated 6 months ago
- ☆31Updated 2 years ago
- (WIP)long form speech generatoins☆30Updated 2 months ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆68Updated 3 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆20Updated 3 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".☆24Updated last year
- ☆11Updated 4 years ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Updated last year
- ☆44Updated last year
- Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.☆27Updated 2 years ago
- ☆46Updated 2 months ago