rendchevi / daisy-tts
πΌ Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition
β14Updated 6 months ago
Related projects: β
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusionβ72Updated this week
- VALL-E 2 reproductionβ72Updated 2 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β74Updated 2 months ago
- Application of MB-iSTFT-VITS components to vits2_pytorchβ107Updated 2 months ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variabilityβ79Updated 2 months ago
- An unofficial PyTorch implementation of VALL-Eβ68Updated this week
- β28Updated last month
- A sequence-to-sequence voice conversion toolkit.β84Updated 2 months ago
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representationsβ111Updated 6 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTSβ63Updated last year
- Zero-Shot Emotion Style Transferβ33Updated 5 months ago
- Unsupervised Rhythm Modeling for Voice Conversionβ78Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.ioβ66Updated 11 months ago
- β75Updated 3 months ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repoβ¦β26Updated last year
- β62Updated 4 months ago
- β27Updated 10 months ago
- VoiceBox neural network implementationβ88Updated last month
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.β54Updated last month
- A toolkit to calculate speech audio quality. Not affiliated with the original authorsβ26Updated last month
- All generative model in one for better TTS modelβ64Updated last week
- Reference-aware automatic speech evaluation toolkitβ95Updated 6 months ago
- β69Updated last year
- β32Updated last year
- Speaker change detection using SincNet and an LSTM/Transformerβ39Updated 2 months ago
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-Eβ134Updated last year
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixerβ72Updated last year
- UTokyo-SaruLab MOS Prediction Systemβ48Updated this week
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.β149Updated 6 months ago
- β60Updated last year