IPA Phonemizer/Dephonemizer for 140 human languages
☆55Feb 11, 2026Updated 3 weeks ago
Alternatives and similar repositories for goruut
Users that are interested in goruut are comparing it to the libraries listed below
Sorting:
- IPA Phonetic dataset lexicon☆18Feb 22, 2026Updated last week
- A Golang Text to Speech System☆18Feb 16, 2026Updated 2 weeks ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated 10 months ago
- pytorch model for contexless-phoneme prediction from speech audio☆32Oct 30, 2025Updated 4 months ago
- StyleTTS 2 Optimized Training Fork☆33Feb 2, 2025Updated last year
- ☆23Apr 29, 2025Updated 10 months ago
- A collection of all our phonemeizers for dataset construction and inference☆27Feb 21, 2025Updated last year
- This is a text-processing frontend that converts graphemes to phonemes and then further converts those phonemes into articulatory feature…☆14Sep 23, 2024Updated last year
- ☆57Feb 8, 2026Updated 3 weeks ago
- High quality text-to-speech based on StyleTTS 2.☆73Feb 25, 2026Updated last week
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 8 months ago
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- phone inventory library☆17May 15, 2023Updated 2 years ago
- Hebrew grapheme to phoneme (G2P)☆89Feb 18, 2026Updated 2 weeks ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- ☆70Sep 3, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 7 months ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Sep 21, 2025Updated 5 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- ☆19Mar 22, 2024Updated last year
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆63May 6, 2023Updated 2 years ago
- Tool to make high quality text to speech (tts) corpus from audio + text books.☆28Jul 31, 2025Updated 7 months ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆21May 26, 2025Updated 9 months ago
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with 15M params for real-time, high-quality voices. Open source, fas…☆23Updated this week
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Oct 30, 2024Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆29Oct 15, 2024Updated last year
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- ☆11Nov 7, 2024Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 5 months ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- ☆10Oct 16, 2025Updated 4 months ago
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems☆13Jan 16, 2025Updated last year
- All-in-one Speech Transcription☆10Jan 25, 2026Updated last month
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year