thewh1teagle / phonikudLinks
Hebrew grapheme to phoneme (G2P)
☆85Updated last month
Alternatives and similar repositories for phonikud
Users that are interested in phonikud are comparing it to the libraries listed below
Sorting:
- Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet☆38Updated last year
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"☆105Updated 7 months ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Updated 10 months ago
- ☆57Updated 2 years ago
- Pronounce Arabic words☆19Updated 6 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆41Updated this week
- Universal multilingual automatic speech transcription into IPA☆74Updated 11 months ago
- Hebrew Diacritizer☆48Updated 3 months ago
- phone inventory library☆17Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated 2 years ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 10 months ago
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆27Updated 2 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆38Updated 11 months ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Updated 2 years ago
- ☆11Updated 4 months ago
- scipts for working with open.bible data☆26Updated 4 years ago
- ☆14Updated 10 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆27Updated 10 months ago
- Python module for syllabifying English ARPABET transcriptions☆72Updated 6 years ago
- Finally, some decent sample sentences☆23Updated 2 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- Labeled data for homograph disambiguation☆63Updated 2 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11Updated 3 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- Workflow for forced alignment between languages☆23Updated 2 weeks ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆63Updated 3 years ago
- A set of tools for working with accent data in Mozilla's Common Voice dataset☆14Updated 2 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36Updated last year