mush42/mantoq

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mush42/mantoq)

mush42 / mantoq

Arabic Grapheme-to-Phoneme (G2P) Conversion

☆16

Alternatives and similar repositories for mantoq

Users that are interested in mantoq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

IDEA-Emdoor-Lab / UniTTS
View on GitHub
A TTS Trained on Universal Audio.
☆41Jun 6, 2025Updated last year
mush42 / libtashkeel
View on GitHub
Add Arabic diacritics (tashkeel/harakat) using Rust/Python/C++/WASM and NLP models
☆50Oct 4, 2025Updated 9 months ago
yousefkotp / Egyptian-Arabic-ASR-and-Diarization
View on GitHub
The official submission from Speech Squad team for the MTC-AIC 2 competition of 2024 where an ASR model is developed tailored for the Egy…
☆18Mar 9, 2026Updated 4 months ago
mush42 / optispeech
View on GitHub
A lightweight end-to-end text-to-speech model
☆130Feb 23, 2025Updated last year
PyThaiNLP / thai-g2p-wiktionary-corpus
View on GitHub
Thai Grapheme to Phoneme (G2P) Wiktionary Corpus
☆13Jul 25, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
mkgs210 / batch_fish_speech
View on GitHub
Boost your efficiency with Fish Speech Batch Inference. Easily process multiple texts and achieve consistently great results. 🗨️🐟
☆27Aug 4, 2025Updated 11 months ago
elazarg / nakdimon
View on GitHub
Hebrew Diacritizer
☆48Updated this week
stlohrey / dia-finetuning
View on GitHub
A TTS model capable of generating ultra-realistic dialogue in one pass.
☆131Jul 25, 2025Updated last year
dscripka / openSpeechToIntent
View on GitHub
A simple, but performant framework for mapping speech directly to categories and intents.
☆27Aug 8, 2024Updated last year
carlfm01 / my-speech-datasets
View on GitHub
My public domain speech index
☆13Sep 19, 2019Updated 6 years ago
Navid-Gen-AI / arabic-tts-arena
View on GitHub
The modal backend for Arabic TTS Arena
☆26Updated this week
duerig / StyleTTS2
View on GitHub
StyleTTS 2 Optimized Training Fork
☆32Feb 2, 2025Updated last year
SparkAudio / SparkVox
View on GitHub
☆37Jun 9, 2025Updated last year
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
thu-spmi / CTC-TTS
View on GitHub
Code for CTC-TTS: LLM-based dual-streaming text-to-speech with CTC alignment, Interspeech 2026.
☆20Jun 9, 2026Updated last month
mush42 / hareef
View on GitHub
state-of-the-art models for diacritics restoration for Arabic language
☆16Feb 23, 2025Updated last year
lugan113 / SynTTS-Commands-Official
View on GitHub
SynTTS-Commands is a large-scale, multilingual (English & Chinese) synthetic speech command dataset designed for low-power Keyword Spotti…
☆17Feb 5, 2026Updated 5 months ago
NickZaitsev / ru-normalizr
View on GitHub
ru-normalizr — лучший нормализатор русского текста без LLM. Приводит числа, даты, время, сокращения, римские цифры, символы и латиницу в …
☆18Updated this week
EraX-AI / viPiper
View on GitHub
An Enhanced Version of Piper especially for Vietnamese :)
☆27Apr 24, 2025Updated last year
primepake / F5-TTS-meanflow-multilingual
View on GitHub
Meanflow and multilingual for F5-TTS model
☆16Aug 23, 2025Updated 11 months ago
changelinglab / PhoneticXeus
View on GitHub
A universal phone recognizer that can transcribe speech in 70+ languages into IPA
☆26Jun 9, 2026Updated last month
WangHelin1997 / Aty-TTS
View on GitHub
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
☆11May 14, 2025Updated last year
r-dh / dutch-vl-tts
View on GitHub
Free Dutch voice dataset
☆13Jan 28, 2021Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
maida-ai / maida
View on GitHub
Reliability Layer for your AI Agents
☆24Updated this week
maxmelichov / BlueTTS
View on GitHub
Fastest Open Source TTS Model
☆77Updated this week
Zhongxu-Wang / ArtSpeech
View on GitHub
ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations
☆22Sep 21, 2025Updated 10 months ago
ASLP-lab / Smart-Glass-Challenge
View on GitHub
☆17Jun 16, 2026Updated last month
Tikai7 / DiTTO-TTS
View on GitHub
DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors
☆39Feb 11, 2025Updated last year
bookbot-hive / k2-indonesian-asr
View on GitHub
Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).
☆16Jun 30, 2023Updated 3 years ago
liuhuang31 / HiFTNet-sr
View on GitHub
HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz
☆24Jan 2, 2024Updated 2 years ago
xi-j / Style-Talker
View on GitHub
An official implementation of Style-Talker for Spoken Dialogue Generation
☆23Jan 12, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
AIRI-Institute / AI4TALK
View on GitHub
☆13Dec 7, 2022Updated 3 years ago
marytts / pavoque-data
View on GitHub
PAVOQUE Corpus of Expressive Speech
☆12Aug 2, 2016Updated 9 years ago
LAION-AI / scaled-echo-tts
View on GitHub
Scaled diffusion transformer for text-to-speech synthesis (DiT + T5Gemma2 conditioning, TorchTitan & Megatron backends, tested up to 1024…
☆24Mar 29, 2026Updated 3 months ago
jakariaemon / WSI
View on GitHub
Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.
☆26Jun 29, 2026Updated 3 weeks ago
jxnl / resume-nextjs
View on GitHub
☆10Jan 16, 2024Updated 2 years ago
freds0 / CML-TTS-Dataset
View on GitHub
CML-TTS: A Multilingual Dataset for Speech Synthesis
☆36Jul 31, 2024Updated last year
alexisdmacintyre / SpeechBreathingToolbox
View on GitHub
Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.
☆11Feb 17, 2024Updated 2 years ago