adrianlyjak / kokoro-onnx-exportLinks
☆16Updated 4 months ago
Alternatives and similar repositories for kokoro-onnx-export
Users that are interested in kokoro-onnx-export are comparing it to the libraries listed below
Sorting:
- ONNX Inference of Pyannote Segmentation☆92Updated 8 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆119Updated 2 weeks ago
- High quality text-to-speech based on StyleTTS 2.☆60Updated this week
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆42Updated last year
- ☆34Updated last week
- Open TTS models, built for streaming on the edge☆43Updated 5 months ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆33Updated this week
- Colab notebooks for Next-gen Kaldi☆28Updated this week
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆42Updated last month
- VoiceBox neural network implementation☆109Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆108Updated 2 years ago
- Official implementation of the TTS model Lina-Speech☆168Updated 7 months ago
- Onnx compatible styletts2 code☆13Updated 2 months ago
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆115Updated 2 months ago
- A lightweight end-to-end text-to-speech model☆118Updated 6 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆84Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆67Updated last month
- ☆275Updated last month
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆84Updated 9 months ago
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆499Updated last week
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated 10 months ago
- Real-time Speech-Text Foundation Model Toolkit (wip)☆243Updated 5 months ago
- Running the F5-TTS by ONNX Runtime☆176Updated 2 weeks ago
- We Speech Transcript based on LLM, in 300 lines of code.☆176Updated 2 months ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆37Updated 3 months ago
- An unofficial pytorch implementation of "STREAMVC: REAL-TIME LOW-LATENCY VOICE CONVERSION".☆73Updated 4 months ago
- ☆273Updated last year
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆184Updated 11 months ago
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆122Updated 3 months ago