A random walk voice style cloning application for Kokoro text to speech
☆256Apr 6, 2026Updated 2 months ago
Alternatives and similar repositories for kvoicewalk
Users that are interested in kvoicewalk are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆58Feb 8, 2026Updated 4 months ago
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.☆55Mar 11, 2025Updated last year
- StyleTTS 2 Optimized Training Fork☆32Feb 2, 2025Updated last year
- High quality text-to-speech based on StyleTTS 2.☆77Apr 6, 2026Updated 2 months ago
- pytorch model for contexless-phoneme prediction from speech audio☆32Oct 30, 2025Updated 7 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- An OCaml extension for RISC-V☆16Nov 6, 2020Updated 5 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- [Interspeech 2025] DualCodec: A Low-Frame-Rate, Semantically-Enhanced Neural Audio Codec☆68Mar 11, 2026Updated 3 months ago
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated last year
- My computer science coursework on maze generation and pathfinding that got 75/75 marks.☆22Sep 19, 2024Updated last year
- Training code for kokoro tts model☆45Nov 15, 2025Updated 7 months ago
- Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…☆100May 18, 2026Updated last month
- Automated speech dataset creator☆222Jun 12, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆21Mar 7, 2025Updated last year
- Open TTS models, built for streaming on the edge☆45Mar 16, 2025Updated last year
- IPA Phonemizer/Dephonemizer for 140 human languages☆60May 6, 2026Updated last month
- ☆302Jul 22, 2025Updated 10 months ago
- speaker-disentangled speech linguistic content quantizer☆25Mar 19, 2025Updated last year
- A cli app for experimenting with kokoro voice creating and mixing using the available voices to interpolate new ones☆38Feb 5, 2025Updated last year
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year
- Surgically de-slop LLMs☆14Jun 1, 2025Updated last year
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆28Aug 22, 2025Updated 9 months ago
- ☆13Mar 10, 2025Updated last year
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets☆138Aug 10, 2025Updated 10 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆355Apr 10, 2025Updated last year
- VLLM Port of the Chatterbox TTS model☆377Oct 18, 2025Updated 8 months ago
- [ACL 2026 Main] MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows☆140Sep 2, 2025Updated 9 months ago
- ☆41Jul 15, 2025Updated 11 months ago
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆119Nov 24, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A TTS Trained on Universal Audio.☆41Jun 6, 2025Updated last year
- ☆83Feb 28, 2025Updated last year
- ☆13Dec 7, 2022Updated 3 years ago
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆94Nov 24, 2025Updated 6 months ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆18Jan 15, 2026Updated 5 months ago
- Forced alignment decoder for Whisper.☆16Mar 13, 2024Updated 2 years ago
- ☆13Apr 26, 2026Updated last month