adrianlyjak / kokoro-onnx-exportLinks
☆13Updated 2 months ago
Alternatives and similar repositories for kokoro-onnx-export
Users that are interested in kokoro-onnx-export are comparing it to the libraries listed below
Sorting:
- High quality text-to-speech based on StyleTTS 2.☆52Updated this week
- IPA Phonemizer/Dephonemizer for 139 human languages☆30Updated this week
- VoiceBox neural network implementation☆108Updated 11 months ago
- ONNX Inference of Pyannote Segmentation☆92Updated 6 months ago
- StyleTTS 2 Optimized Training Fork☆32Updated 5 months ago
- Colab notebooks for Next-gen Kaldi☆28Updated 3 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆41Updated 2 weeks ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆92Updated last month
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆35Updated last month
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆102Updated last month
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆24Updated last year
- Simple PyTorch Denoisers for Waveform Audio☆35Updated 2 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆110Updated last month
- ☆28Updated 5 months ago
- Open TTS models, built for streaming on the edge☆43Updated 4 months ago
- An unofficial pytorch implementation of "STREAMVC: REAL-TIME LOW-LATENCY VOICE CONVERSION".☆69Updated 3 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated 3 months ago
- VALL-E 2 reproduction☆129Updated last year
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆184Updated 9 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆100Updated 9 months ago
- ☆25Updated 2 weeks ago
- AudioSR-Upsampling (any -> 48kHz)☆41Updated last year
- Unofficial implementation of wavenext vocoder☆48Updated 10 months ago
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆63Updated 2 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆194Updated 2 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆80Updated 11 months ago
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆135Updated 6 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"☆38Updated 11 months ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 3 months ago