πΌ Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition
β14Nov 15, 2025Updated 4 months ago
Alternatives and similar repositories for daisy-tts
Users that are interested in daisy-tts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β13Sep 12, 2024Updated last year
- Simple audio AEβ13Nov 10, 2024Updated last year
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Convβ¦β21Sep 18, 2023Updated 2 years ago
- VQCPC-GAN: Variable-length Adversarial Audio Synthesis using Vector-Quantized Contrastive Predictive Codingβ14Apr 27, 2021Updated 4 years ago
- β14Aug 19, 2024Updated last year
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β108Mar 15, 2026Updated last week
- StyleTTS2 + Vocos as a Decoderβ13Mar 24, 2025Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.β24Aug 1, 2025Updated 7 months ago
- Caption, translate, and optionally record in real time "what you hear" from speakers and microphone. Never miss part of the conversation β¦β23Sep 11, 2025Updated 6 months ago
- mmyunβ17Aug 4, 2025Updated 7 months ago
- a Frontier Japanese Speech Generation netβ63May 15, 2025Updated 10 months ago
- speaker-disentangled speech linguistic content quantizerβ24Mar 19, 2025Updated last year
- Reimplementation of speech decoding 2022 paper by MetaAIβ14Oct 17, 2023Updated 2 years ago
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.β36Sep 21, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository contains the source code for a mobile application built using Kotlin Multiplatform. The app enables mesh networking functβ¦β12Feb 23, 2026Updated last month
- β23Dec 23, 2025Updated 3 months ago
- β13Mar 10, 2025Updated last year
- Real-Time ASR with CNN-BiLSTM: End-to-End Live Streaming Using PyTorch Lightningβ‘β11Jan 23, 2025Updated last year
- Reference-aware automatic speech evaluation toolkitβ180Dec 5, 2024Updated last year
- A pitch detection model trained to be robust against noise and reverberation environments.β27Jan 21, 2025Updated last year
- Zero-Shot Emotion Style Transferβ49Apr 23, 2025Updated 11 months ago
- Supervoice diffusion enhanceβ28Jul 15, 2024Updated last year
- β19Sep 19, 2023Updated 2 years ago
- NordVPN Special Discount Offer β’ AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A toolkit to calculate speech audio quality. Not affiliated with the original authorsβ70Aug 13, 2024Updated last year
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHzβ24Jan 2, 2024Updated 2 years ago
- A collection of utilities for handling IPA phones.β26Sep 24, 2023Updated 2 years ago
- β122Oct 24, 2022Updated 3 years ago
- β13Oct 3, 2024Updated last year
- Sound Separation, Omni modalβ28Sep 15, 2025Updated 6 months ago
- An Android compatible espeak-ng versionβ12Aug 5, 2023Updated 2 years ago
- Android project that is using FastSAM model for segment anything with live camera feed and gallery images.β11Nov 9, 2025Updated 4 months ago
- Some examples with MediaPipeβ14Apr 1, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Telegram Desktop with Compose Multiplatform framework.β11Jan 8, 2026Updated 2 months ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.β24Dec 20, 2022Updated 3 years ago
- Appwrite + Stripe integration showcaseβ12Apr 4, 2022Updated 3 years ago
- β31Oct 29, 2024Updated last year
- HiFi-SR is a Python-based pipeline for the detection of plant mitochondrial structural rearrangements based on the mapping of PacBio highβ¦β10Apr 15, 2025Updated 11 months ago
- A fountain script add-on for Blenderβ13Jul 6, 2020Updated 5 years ago
- An end-to-end library for training audio wake-word models and deploying them in the browser.β39Jul 25, 2025Updated 8 months ago