rendchevi/daisy-tts

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rendchevi/daisy-tts)

rendchevi / daisy-tts

🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition

☆14

Alternatives and similar repositories for daisy-tts

Users that are interested in daisy-tts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

7Xin / DPI-TTS
View on GitHub
☆13Sep 12, 2024Updated last year
zy-du / Disentanglement-of-Emotional-Style-and-Speaker-Identity-for-Expressive-Voice-Conversion
View on GitHub
This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…
☆21Sep 18, 2023Updated 2 years ago
Tera2Space / AudioAE
View on GitHub
Simple audio AE
☆13Nov 10, 2024Updated last year
LuluW8071 / Automatic-Speech-Recognition-with-PyTorch
View on GitHub
Real-Time ASR with CNN-BiLSTM: End-to-End Live Streaming Using PyTorch Lightning⚡
☆11Jan 23, 2025Updated last year
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
NeuralVox / OpenPhonemizer
View on GitHub
An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…
☆111Mar 15, 2026Updated 4 months ago
5Hyeons / StyleTTS2-Vocos
View on GitHub
StyleTTS2 + Vocos as a Decoder
☆13Mar 24, 2025Updated last year
tonnetonne814 / PITS-44100-Ja
View on GitHub
44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。
☆21May 2, 2023Updated 3 years ago
SeanNobel / speech-decoding
View on GitHub
Reimplementation of speech decoding 2022 paper by MetaAI
☆14Oct 17, 2023Updated 2 years ago
vtuber-plan / hifi-gan
View on GitHub
An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.
☆32Apr 10, 2023Updated 3 years ago
JJWRoeloffs / transcribe_align_textgrid
View on GitHub
A small wrapper package around whisper-timestamped. Create force-aligned transcription TextGrids from raw audio!
☆18Dec 16, 2025Updated 7 months ago
yl4579 / AuxiliaryASR
View on GitHub
Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)
☆127Jun 16, 2022Updated 4 years ago
mzarvandi / SER-wav2vec
View on GitHub
Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.
☆17Aug 8, 2021Updated 4 years ago
hcy71o / SC-VITS
View on GitHub
VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.
☆36Sep 21, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
anqorithm / Saudi-CERT-API
View on GitHub
This repository has a tool and an API for Saudi CERT alerts. Its goal is to help improve the level of cybersecurity awareness in Saudi Ar…
☆13Nov 16, 2023Updated 2 years ago
syv-ai / PybberLink
View on GitHub
☆13Mar 10, 2025Updated last year
Choddeok / DiEmo-TTS
View on GitHub
[INTERSPEECH 2025] The official implementation of DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for…
☆17Jul 16, 2026Updated last week
xinshengwang / robpitch
View on GitHub
A pitch detection model trained to be robust against noise and reverberation environments.
☆27Jan 21, 2025Updated last year
Takaaki-Saeki / DiscreteSpeechMetrics
View on GitHub
Reference-aware automatic speech evaluation toolkit
☆185Dec 5, 2024Updated last year
iiscleap / ZEST
View on GitHub
Zero-Shot Emotion Style Transfer
☆49Apr 23, 2025Updated last year
Choddeok / Affectron
View on GitHub
[ACL 2026 Findings] Affectron: Emotional Speech Synthesis with Affective and Contextually Aligned Nonverbal Vocalizations
☆20Jul 16, 2026Updated last week
ex3ndr / supervoice-enhance
View on GitHub
Supervoice diffusion enhance
☆28Jul 15, 2024Updated 2 years ago
Respaired / Tsukasa-Speech
View on GitHub
a Frontier Japanese Speech Generation net
☆65May 15, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
taylorchu / 2cent-tts
View on GitHub
☆58Feb 8, 2026Updated 5 months ago
haoweilou / ParaStyleTTS
View on GitHub
This is the official code for ACM CIKM 2025 Paper: ParaStyleTTS: Toward Efficient and Robust Paralinguistic Style Control for Expressive …
☆59Dec 21, 2025Updated 7 months ago
Mu-Y / DiariST
View on GitHub
☆18Sep 19, 2023Updated 2 years ago
fakerybakery / utmos
View on GitHub
A toolkit to calculate speech audio quality. Not affiliated with the original authors
☆74Aug 13, 2024Updated last year
zhhaibl / mmy
View on GitHub
mmyun
☆19Aug 4, 2025Updated 11 months ago
liuhuang31 / HiFTNet-sr
View on GitHub
HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz
☆24Jan 2, 2024Updated 2 years ago
p1an-lin-jung / WavThruVec_pytorch
View on GitHub
An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"
☆29Sep 6, 2023Updated 2 years ago
MiniXC / phones
View on GitHub
A collection of utilities for handling IPA phones.
☆27Sep 24, 2023Updated 2 years ago
rajasegar / htmx-playground
View on GitHub
☆13Oct 3, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
KunZhou9646 / Mixed_Emotions
View on GitHub
☆123Oct 24, 2022Updated 3 years ago
SmoothKen / knn-svc
View on GitHub
kNN-SVC: Robust Zero-Shot Singing Voice Conversion with Additive Synthesis and Concatenation Smoothness Optimization
☆16Nov 7, 2025Updated 8 months ago
Plachtaa / ASTRAL-quantization
View on GitHub
speaker-disentangled speech linguistic content quantizer
☆26Mar 19, 2025Updated last year
Meldiron / almost-cookie-store
View on GitHub
Appwrite + Stripe integration showcase
☆12Apr 4, 2022Updated 4 years ago
zrr1999 / emotion-recognition
View on GitHub
多模态情绪识别方法研究（Multimodal Emotion Recognition）
☆28Mar 24, 2026Updated 4 months ago
duerig / StyleTTS2
View on GitHub
StyleTTS 2 Optimized Training Fork
☆32Feb 2, 2025Updated last year
mbzuai-nlp / sttatts
View on GitHub
☆31Oct 29, 2024Updated last year