Tacotron2 with BERT examples
☆10Jul 8, 2019Updated 6 years ago
Alternatives and similar repositories for Taco2withBERT
Users that are interested in Taco2withBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- ☆19Feb 28, 2018Updated 8 years ago
- ☆31Nov 7, 2018Updated 7 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- ☆18Aug 9, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- using world vocoder to extract features and make data for training neural networks☆11Oct 9, 2017Updated 8 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Mar 24, 2023Updated 3 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Oct 22, 2022Updated 3 years ago
- Speech (audio) subjective evaluation system☆42Jul 15, 2020Updated 5 years ago
- Surrey CVSSP DCASE 2018 Task 2 system☆20Dec 26, 2022Updated 3 years ago
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Nov 1, 2023Updated 2 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆116Dec 22, 2021Updated 4 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆44Dec 17, 2020Updated 5 years ago
- DCASE2019 Challenge Task 1 baseline system☆20Oct 11, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 3 years ago
- An DSP library written in Python for performing HRTFs☆21Aug 15, 2016Updated 9 years ago
- Core code for my ICASSP 2018 paper☆53Jul 27, 2018Updated 7 years ago
- TensorFlow Implementation of CDVAE-VC.☆54Mar 24, 2023Updated 3 years ago
- ☆11Mar 15, 2017Updated 9 years ago
- ☆25Mar 12, 2022Updated 4 years ago
- ☆34Mar 21, 2026Updated last week
- Mel-Generalized Cepstrum analysis☆19Jul 21, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Dec 26, 2019Updated 6 years ago
- Chinese Text Normalization and Dataset☆91May 14, 2022Updated 3 years ago
- Homemade LightGBM and VGG-net experiment setup for DCASE2017 task 1☆11Aug 8, 2017Updated 8 years ago
- These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB☆50Apr 9, 2019Updated 6 years ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆27May 30, 2025Updated 10 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64May 30, 2023Updated 2 years ago
- Source code for paper: INTERVENOR : Prompt the Coding Ability of Large Language Models with the Interactive Chain of Repairing☆30Nov 25, 2024Updated last year
- Demo audio of VARA-TTS model☆20Jun 11, 2021Updated 4 years ago
- Audio streaming transfer demo with google.api.HttpBody and grpc gateway for speech synthesis☆20Jan 28, 2020Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- WaveRNN-based waveform generator & demo of TensorFlow CuDNN-GRU usage.☆24Aug 19, 2018Updated 7 years ago
- ☆12Nov 19, 2024Updated last year
- An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" …☆114Jun 19, 2020Updated 5 years ago
- ☆13Dec 8, 2022Updated 3 years ago
- Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing☆89Sep 6, 2024Updated last year
- Tacotron2 + LPCNET for complete End-to-End TTS System☆93Jul 6, 2023Updated 2 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Feb 27, 2020Updated 6 years ago