mush42 / mantoqView external linksLinks
Arabic Grapheme-to-Phoneme (G2P) Conversion
☆13Mar 15, 2025Updated 10 months ago
Alternatives and similar repositories for mantoq
Users that are interested in mantoq are comparing it to the libraries listed below
Sorting:
- A simple, but performant framework for mapping speech directly to categories and intents.☆25Aug 8, 2024Updated last year
- My public domain speech index☆13Sep 19, 2019Updated 6 years ago
- Thai Grapheme to Phoneme (G2P) Wiktionary Corpus☆13Jul 25, 2022Updated 3 years ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 8 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15May 16, 2025Updated 8 months ago
- Add Arabic diacritics (tashkeel/harakat) using Rust/Python/C++/WASM and NLP models☆45Oct 4, 2025Updated 4 months ago
- Free Dutch voice dataset☆12Jan 28, 2021Updated 5 years ago
- ☆32Oct 23, 2025Updated 3 months ago
- ☆34Jun 9, 2025Updated 8 months ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 6 months ago
- The official submission from Speech Squad team for the MTC-AIC 2 competition of 2024 where an ASR model is developed tailored for the Egy…☆16Mar 18, 2025Updated 10 months ago
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆26Mar 17, 2025Updated 10 months ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated last month
- Hebrew Diacritizer☆48Oct 29, 2025Updated 3 months ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Sep 21, 2025Updated 4 months ago
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Jan 12, 2025Updated last year
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆51Updated this week
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- SoTA open-source TTS☆135Jun 7, 2025Updated 8 months ago
- An Enhanced Version of Piper especially for Vietnamese :)☆26Apr 24, 2025Updated 9 months ago
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆36Dec 24, 2025Updated last month
- Faster Whisper ASR transcription with CTranslate2☆24Oct 25, 2024Updated last year
- finetune llm part for spark-tts model☆120Mar 25, 2025Updated 10 months ago
- ☆52Jul 16, 2025Updated 6 months ago
- A collection of all our phonemeizers for dataset construction and inference☆27Feb 21, 2025Updated 11 months ago
- ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models☆34Nov 18, 2025Updated 2 months ago
- IPA tokeniser☆19Jul 28, 2025Updated 6 months ago
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Jan 2, 2024Updated 2 years ago
- High quality text-to-speech based on StyleTTS 2.☆71Dec 16, 2025Updated last month
- A TTS Trained on Universal Audio.☆41Jun 6, 2025Updated 8 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Oct 29, 2022Updated 3 years ago
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆27Mar 21, 2025Updated 10 months ago
- StyleTTS 2 Optimized Training Fork☆33Feb 2, 2025Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Sep 23, 2020Updated 5 years ago
- This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …☆57Aug 9, 2025Updated 6 months ago
- Baselines for IS25 Source Tracing Special Session☆33Jan 3, 2025Updated last year
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆35Feb 11, 2025Updated last year
- Code for the blog "Neural audio codecs: how to get audio into LLMs"☆151Oct 20, 2025Updated 3 months ago
- This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…☆30Mar 6, 2025Updated 11 months ago