A Flow Matching-based Text-to-Speech Model with Emoji-driven Style Control
☆28Feb 27, 2026Updated this week
Alternatives and similar repositories for Irodori-TTS
Users that are interested in Irodori-TTS are comparing it to the libraries listed below
Sorting:
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆28Sep 20, 2025Updated 5 months ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Jan 10, 2025Updated last year
- ☆49Feb 12, 2026Updated 3 weeks ago
- ☆51Dec 24, 2025Updated 2 months ago
- An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control☆31Jan 13, 2026Updated last month
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆30Aug 2, 2025Updated 7 months ago
- 网络出处:Interactive Speech and Noise Modeling for Speech Enhancement☆28Jan 10, 2022Updated 4 years ago
- InSales e-commerce platform API bindings☆14Jul 13, 2024Updated last year
- The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention tra…☆29Jul 25, 2022Updated 3 years ago
- Inference server for MioTTS, a lightweight and fast LLM-based TTS model.☆103Feb 14, 2026Updated 2 weeks ago
- ☆11Apr 1, 2025Updated 11 months ago
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- ☆80Aug 11, 2025Updated 6 months ago
- Bandwidth Extension of Historical Recordings using Generative Adversarial Networks☆35May 25, 2023Updated 2 years ago
- A Temporal-Spectral Generative Adversarial Network based End-to-end Packet Loss Concealment for Wideband Speech Transmission☆32Apr 27, 2022Updated 3 years ago
- List of repositories relevant to VITS.☆36Feb 26, 2023Updated 3 years ago
- Deep model with built-in self-attention alignment for acoustic echo cancellation, Pytorch implement☆39Jul 25, 2023Updated 2 years ago
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 5 months ago
- ☆29Dec 20, 2025Updated 2 months ago
- A news based stock scalper using LLM and quant approach☆15Jan 16, 2025Updated last year
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- Neural Homomorphic Vocoder optimized for singing voice synthesis☆18Updated this week
- ☆15Sep 16, 2024Updated last year
- A local, voice-controlled AI assistant with the personality of HAL 9000 from 2001: A Space Odyssey.