rendchevi / daisy-tts
πΌ Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition
β15Updated last year
Alternatives and similar repositories for daisy-tts:
Users that are interested in daisy-tts are comparing it to the libraries listed below
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β61Updated this week
- Zero-Shot Emotion Style Transferβ41Updated 10 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.β80Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β93Updated 4 months ago
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.β36Updated 2 years ago
- β42Updated 2 weeks ago
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representationsβ144Updated 11 months ago
- The official implementation of EmoSphere++β73Updated last month
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.β15Updated 3 months ago
- Speaker change detection using SincNet and an LSTM/Transformerβ47Updated 8 months ago
- Application of MB-iSTFT-VITS components to vits2_pytorchβ122Updated 3 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,β¦β66Updated 5 months ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordingsβ¦β81Updated last month
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.ioβ68Updated last year
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesisβ125Updated 2 months ago
- β59Updated last year
- β21Updated 3 weeks ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paperβ22Updated 2 years ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistencyβ51Updated 4 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTSβ63Updated last year
- An unofficial PyTorch implementation of VALL-Eβ87Updated this week
- β69Updated last year
- Train the next generation of TTS systems.β162Updated 5 months ago
- β63Updated 5 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.β77Updated 2 months ago
- β36Updated 5 months ago
- Toolbox for easy and qualitative one-shot voice conversionβ45Updated 3 years ago
- β37Updated 11 months ago
- A sequence-to-sequence voice conversion toolkit.β93Updated 7 months ago
- Unsupervised Rhythm Modeling for Voice Conversionβ80Updated last year