Zero-shot expressive voice cloning and speech generation. Generate anything from short clips to full-length audiobooks with realistic emotional delivery, pacing, and breath control. Clone any voice from a 10-second reference and perform emotions the original speaker never recorded.
☆355May 15, 2026Updated this week
Alternatives and similar repositories for scenema-audio
Users that are interested in scenema-audio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 2020년 21대 국회의원 총선거 지도☆11Mar 19, 2020Updated 6 years ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- ☆21Dec 5, 2022Updated 3 years ago
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated 2 years ago
- Official implemtation of UniverSR (ICASSP 2026)☆48Apr 9, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- SGLang is a fast serving framework for large language models and vision language models.☆21May 22, 2025Updated 11 months ago
- ☆36Oct 23, 2025Updated 6 months ago
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆23Mar 14, 2024Updated 2 years ago
- ☆11Nov 26, 2024Updated last year
- Voice Operation and Design Engine with Reproduction capabilities☆118May 2, 2026Updated last week
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆39Dec 24, 2025Updated 4 months ago
- ComfyUI node for AudioSR - Versatile Audio Super Resolution upscales audio to 48kHz using latent diffusion