rendchevi / daisy-tts
πΌ Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition
β15Updated 8 months ago
Related projects β
Alternatives and complementary repositories for daisy-tts
- Application of MB-iSTFT-VITS components to vits2_pytorchβ118Updated this week
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representationsβ126Updated 8 months ago
- Update ASR paper everydayβ54Updated this week
- A sequence-to-sequence voice conversion toolkit.β86Updated 4 months ago
- β77Updated 6 months ago
- Zero-Shot Emotion Style Transferβ37Updated 7 months ago
- The official implementation of EmoSphere-TTSβ85Updated 3 months ago
- The official implementation of EmoSphere++β41Updated 2 weeks ago
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literatureβ66Updated last month
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repoβ¦β26Updated last year
- VALL-E 2 reproductionβ87Updated 4 months ago
- Reference-aware automatic speech evaluation toolkitβ109Updated 9 months ago
- β62Updated last year
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learningβ81Updated this week
- A toolkit to calculate speech audio quality. Not affiliated with the original authorsβ39Updated 3 months ago
- Speaker change detection using SincNet and an LSTM/Transformerβ44Updated 4 months ago
- β58Updated 2 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.β71Updated last year
- Train the next generation of TTS systems.β161Updated 2 months ago
- An unofficial PyTorch implementation of VALL-Eβ77Updated this week
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTSβ64Updated last year
- β36Updated 7 months ago
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3β167Updated 7 months ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" accβ¦β71Updated last year
- Implementation of SoundStorm built upon SpeechTokenizer.β104Updated last year
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variabilityβ93Updated 3 weeks ago
- β47Updated 3 weeks ago
- Unofficial implementation of miipherβ113Updated 7 months ago
- This is the M-AILABS Speech Datasetβ22Updated 4 months ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)β112Updated 2 years ago