rendchevi / daisy-ttsLinks
š¼ Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition
ā15Updated last year
Alternatives and similar repositories for daisy-tts
Users that are interested in daisy-tts are comparing it to the libraries listed below
Sorting:
- High quality text-to-speech based on StyleTTS 2.ā63Updated this week
- Zero-Shot Emotion Style Transferā49Updated 4 months ago
- StyleTTS 2 Optimized Training Forkā33Updated 7 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPā¦ā103Updated 11 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.ā68Updated last week
- StyleTTS2 + Vocos as a Decoderā13Updated 5 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,ā¦ā78Updated 11 months ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.ā18Updated 9 months ago
- An unofficial PyTorch implementation of VALL-Eā88Updated last month
- A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTSā49Updated 9 months ago
- ā50Updated 5 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.ioā69Updated last year
- The Vokan Architecture (Tsukasa speech based)ā10Updated 7 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversionā103Updated last year
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.ā42Updated last week
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.ā85Updated 10 months ago
- Open TTS models, built for streaming on the edgeā42Updated 6 months ago
- VoiceBox neural network implementationā110Updated last year
- C++ version of pyannote audio overlapped speech detection pipelineā13Updated last year
- Unsupervised Rhythm Modeling for Voice Conversionā84Updated 2 years ago
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.ā36Updated 2 years ago
- ā43Updated 11 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTSā63Updated 2 years ago
- ā28Updated last year
- Official Code for ParrotTTSā54Updated 11 months ago
- ā14Updated last year
- ā82Updated 3 months ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variabilityā102Updated 8 months ago
- Google's SoundStorm: Efficient Parallel Audio Generationā132Updated 2 years ago
- Implementation of Emo-StarGANā45Updated last year