RobViren / kvoicewalkView external linksLinks
A random walk voice style cloning application for Kokoro text to speech
☆210Jun 16, 2025Updated 7 months ago
Alternatives and similar repositories for kvoicewalk
Users that are interested in kvoicewalk are comparing it to the libraries listed below
Sorting:
- ☆56Jan 17, 2026Updated 3 weeks ago
- StyleTTS 2 Optimized Training Fork☆33Feb 2, 2025Updated last year
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.☆52Mar 11, 2025Updated 11 months ago
- IPA Phonemizer/Dephonemizer for 140 human languages☆54Jan 8, 2026Updated last month
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated 10 months ago
- A cli app for experimenting with kokoro voice creating and mixing using the available voices to interpolate new ones☆36Feb 5, 2025Updated last year
- High quality text-to-speech based on StyleTTS 2.☆72Updated this week
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36May 1, 2024Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- Open TTS models, built for streaming on the edge☆45Mar 16, 2025Updated 10 months ago
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆17Feb 1, 2026Updated 2 weeks ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated 10 months ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- VLLM Port of the Chatterbox TTS model☆365Oct 18, 2025Updated 3 months ago
- Training code for kokoro tts model☆33Nov 15, 2025Updated 3 months ago
- ☆13Dec 7, 2022Updated 3 years ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆20Jun 7, 2025Updated 8 months ago
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- Automated speech dataset creator☆215Jun 12, 2025Updated 8 months ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆17Jan 15, 2026Updated 3 weeks ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆16Sep 13, 2024Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆27Mar 14, 2025Updated 11 months ago
- ☆13Sep 12, 2024Updated last year
- Generate a llama-quantize command to copy the quantization parameters of any GGUF☆30Jan 23, 2026Updated 3 weeks ago
- ☆20Mar 7, 2025Updated 11 months ago
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Jul 2, 2024Updated last year
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆91Nov 24, 2025Updated 2 months ago
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year
- PowerShell scripts to easily set up and run the Qwen3-Coder-Next 80B model locally on Windows using the llama.cpp.☆32Updated this week
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 6 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆135Aug 10, 2025Updated 6 months ago
- Application of MB-iSTFT-VITS components to vits2_pytorch☆132Dec 29, 2025Updated last month
- ☆297Jul 22, 2025Updated 6 months ago
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated 10 months ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Sep 21, 2025Updated 4 months ago