☆21Feb 27, 2024Updated 2 years ago
Alternatives and similar repositories for FastSpeech2_ACL2022_reproducibility
Users that are interested in FastSpeech2_ACL2022_reproducibility are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆25Mar 12, 2022Updated 4 years ago
- TTS Text Analyzer☆31Jul 20, 2023Updated 2 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆88Dec 20, 2022Updated 3 years ago
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023☆27Apr 27, 2023Updated 3 years ago
- ☆101Jan 19, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 6 years ago
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations☆141Apr 27, 2024Updated 2 years ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Aug 4, 2023Updated 2 years ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆45Jul 24, 2023Updated 2 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆149Aug 22, 2022Updated 3 years ago
- ☆13Nov 16, 2020Updated 5 years ago
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆33Jun 14, 2024Updated last year
- High-level API for tar-based dataset☆12Feb 3, 2024Updated 2 years ago
- Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech☆99Oct 14, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, …☆292Apr 6, 2023Updated 3 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆65May 30, 2023Updated 2 years ago
- A simple voice conversion tool☆20Mar 10, 2022Updated 4 years ago
- A tensorflow based implementation of DeepVoice3 https://arxiv.org/abs/1710.07654☆13Jun 5, 2018Updated 7 years ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆169Apr 10, 2024Updated 2 years ago
- ☆64Sep 18, 2022Updated 3 years ago
- Vocoder NSF-HiFiGAN (Moved into deepaudio)☆56Dec 11, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Blazing fast data loading with HuggingFace Dataset and Ray Data☆16Jan 12, 2024Updated 2 years ago
- ☆44Jun 25, 2025Updated 10 months ago
- Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023☆252Jun 5, 2025Updated 11 months ago
- ☆61Nov 4, 2023Updated 2 years ago
- End-To-End SpeechSynthesis system with knowledge distillation☆17Jul 16, 2022Updated 3 years ago
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆197Feb 10, 2022Updated 4 years ago
- Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"☆14Sep 20, 2024Updated last year
- Diffusion-based Speech Enhancement: Demonstration of Performance and Generalization☆14Dec 21, 2024Updated last year
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Jul 10, 2023Updated 2 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Systematic Survey".☆235Apr 28, 2026Updated last week
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15May 30, 2021Updated 4 years ago
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆19Jun 5, 2023Updated 2 years ago
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"☆44Apr 10, 2023Updated 3 years ago
- Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)☆122Jan 24, 2023Updated 3 years ago