Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
☆39Jul 16, 2020Updated 5 years ago
Alternatives and similar repositories for semi-tts
Users that are interested in semi-tts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An imporved version of Fastsinging singing voice synthesising system.☆21Nov 3, 2020Updated 5 years ago
- Quasi-Periodic Parallel WaveGAN Pytorch implementation☆46Oct 29, 2022Updated 3 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Jun 22, 2022Updated 3 years ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Nov 12, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of the AlignTTS☆77Jul 6, 2023Updated 2 years ago
- ☆64Aug 14, 2023Updated 2 years ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆143Sep 1, 2020Updated 5 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆39Oct 12, 2019Updated 6 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Oct 27, 2020Updated 5 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆132Nov 29, 2020Updated 5 years ago
- ☆34Jul 16, 2019Updated 6 years ago
- PyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity." (NeurIPS 2020)☆67Dec 28, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Nov 14, 2020Updated 5 years ago
- An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…☆118May 27, 2021Updated 4 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 5 years ago
- Collect Voice Conversion researches☆97Updated this week
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆49Mar 10, 2021Updated 5 years ago
- GPT-style network for phonemization with durations of text☆68Mar 21, 2024Updated 2 years ago
- Code for ISMIR 2020 paper: "Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks"☆55Nov 20, 2024Updated last year
- ☆45Dec 16, 2019Updated 6 years ago
- Wavenet Autoencoder for Unsupervised speech representation learning (after Chorowski, Jan 2019)☆176Sep 16, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆116Dec 22, 2021Updated 4 years ago
- ☆25Mar 12, 2022Updated 4 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- An evaluation toolkit for voice conversion models.☆42Jul 11, 2021Updated 4 years ago
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Dec 26, 2019Updated 6 years ago
- ☆46Apr 16, 2023Updated 3 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆184Aug 12, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Pytorch Implementation of WaveNODE☆64Sep 4, 2020Updated 5 years ago
- ☆69Mar 31, 2021Updated 5 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Mar 29, 2022Updated 4 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Oct 22, 2022Updated 3 years ago
- An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".☆34Apr 26, 2021Updated 4 years ago
- ☆16Apr 4, 2022Updated 4 years ago