A random walk voice style cloning application for Kokoro text to speech
☆215Jun 16, 2025Updated 8 months ago
Alternatives and similar repositories for kvoicewalk
Users that are interested in kvoicewalk are comparing it to the libraries listed below
Sorting:
- ☆57Feb 8, 2026Updated last month
- StyleTTS 2 Optimized Training Fork☆33Feb 2, 2025Updated last year
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.☆53Mar 11, 2025Updated 11 months ago
- IPA Phonemizer/Dephonemizer for 140 human languages☆55Feb 11, 2026Updated 3 weeks ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- pytorch model for contexless-phoneme prediction from speech audio☆32Oct 30, 2025Updated 4 months ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated 10 months ago
- A cli app for experimenting with kokoro voice creating and mixing using the available voices to interpolate new ones☆37Feb 5, 2025Updated last year
- High quality text-to-speech based on StyleTTS 2.☆73Feb 25, 2026Updated last week
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36May 1, 2024Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- Open TTS models, built for streaming on the edge☆45Mar 16, 2025Updated 11 months ago
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆16Feb 1, 2026Updated last month
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated 11 months ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- VLLM Port of the Chatterbox TTS model☆371Oct 18, 2025Updated 4 months ago
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 9 months ago
- ☆13Dec 7, 2022Updated 3 years ago
- Automated speech dataset creator☆217Jun 12, 2025Updated 8 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆18Sep 13, 2024Updated last year
- Training code for kokoro tts model☆34Nov 15, 2025Updated 3 months ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆18Jan 15, 2026Updated last month
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆29Mar 14, 2025Updated 11 months ago
- Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…☆82Feb 3, 2026Updated last month
- ☆21Mar 7, 2025Updated last year
- Generate a llama-quantize command to copy the quantization parameters of any GGUF☆30Jan 23, 2026Updated last month
- ☆13Sep 12, 2024Updated last year
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆38Feb 24, 2025Updated last year
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Jul 2, 2024Updated last year
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆92Nov 24, 2025Updated 3 months ago
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year
- Linux & Powershell scripts to easily set up and run the Qwen 3.5 series locally on Windows and Linux with llama.cpp.☆45Mar 2, 2026Updated last week
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 7 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆135Aug 10, 2025Updated 6 months ago
- Application of MB-iSTFT-VITS components to vits2_pytorch☆133Dec 29, 2025Updated 2 months ago