DakeQQ / F5-TTS-ONNX
Running the F5-TTS by ONNX Runtime
☆35Updated this week
Related projects ⓘ
Alternatives and complementary repositories for F5-TTS-ONNX
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆74Updated last month
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆44Updated 3 months ago
- VALL-E 2 reproduction☆87Updated 4 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆70Updated 7 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆25Updated 2 weeks ago
- ☆45Updated 4 months ago
- Official Code for ParrotTTS☆42Updated last month
- VoiceBox neural network implementation☆96Updated 3 months ago
- ☆66Updated last year
- GPT-style network for phonemization with durations of text☆62Updated 8 months ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆50Updated 3 years ago
- ☆20Updated 3 weeks ago
- ☆28Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆83Updated last month
- ☆81Updated 2 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆65Updated last year
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆32Updated last year
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated 3 months ago
- zero-shot realtime TTS system, fully offline, free and open source☆14Updated last week
- 单独维护的中文TTS☆35Updated 2 years ago
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆32Updated last week
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆43Updated last month
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆125Updated 5 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- a lightweight voice conversion☆78Updated 2 months ago
- Putting flows on top of neural transducers for better TTS☆63Updated 3 weeks ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆64Updated last week
- Zero-Shot Emotion Style Transfer☆37Updated 7 months ago
- Chinese and English Bilinguish G2P☆20Updated last year
- An open-source Kazakh Emotional Text-to-Speech Dataset☆25Updated 7 months ago