petronny/g2p

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/petronny/g2p)

petronny / g2p

Pre-trained grapheme-to-phoneme (G2P) models

☆26

Alternatives and similar repositories for g2p

Users that are interested in g2p are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ldong1111 / GraphemeBERT
View on GitHub
This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models
☆48Mar 25, 2022Updated 4 years ago
hardmstar / BhaDistanceBasedZCRofGaussian
View on GitHub
calculate bhattacharyya distance based on zero cross rate feature between different Gaussian model for speech emotion recognition. corpus…
☆11Oct 17, 2018Updated 7 years ago
dtreskunov / tiny-kaldi
View on GitHub
Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.
☆16Nov 14, 2020Updated 5 years ago
aflr-archive / viseme-to-video
View on GitHub
Creates video from TTS output and viseme images.
☆16Jun 18, 2022Updated 4 years ago
Labmem-Zhouyx / CDFSE_FastSpeech2
View on GitHub
The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…
☆86Dec 20, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
thuhcsi / SpanPSP
View on GitHub
☆76Apr 26, 2022Updated 4 years ago
papercup-open-source / subscale-wavernn
View on GitHub
Implementation of the subscale framework from the WaveRNN paper, building on top of Fatchord's WaveRNN repo
☆19Oct 8, 2020Updated 5 years ago
ddlBoJack / Awesome-Speech-Pretraining
View on GitHub
Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.
☆212Jan 18, 2024Updated 2 years ago
speechio / BigCiDian
View on GitHub
Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
☆263Oct 11, 2019Updated 6 years ago
sequitur-g2p / sequitur-g2p
View on GitHub
This is a github repository of the abandonware Sequitur G2P by Bisani & Ney
☆174Dec 16, 2025Updated 7 months ago
thuhcsi / Contextual-Biasing-Dataset
View on GitHub
open-source Mandarian biased word dataset
☆14Sep 21, 2023Updated 2 years ago
peak1995 / tacotron-chinese
View on GitHub
☆15Apr 17, 2019Updated 7 years ago
sagar-spkt / SV2MTTS
View on GitHub
Voice Cloning using SV with GE2E and Tacotron
☆12Mar 25, 2023Updated 3 years ago
dcaulley / av_diarization
View on GitHub
AudioVisual Diarization - Supervised and Unsupervised
☆15Nov 22, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
pengzhendong / g2p-mix
View on GitHub
Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.
☆115Dec 2, 2025Updated 7 months ago
mozillazg / pypinyin-g2pW
View on GitHub
基于 g2pW 提升 pypinyin 的准确性
☆104Jun 24, 2023Updated 3 years ago
thuhcsi / SpeechCraft
View on GitHub
The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.
☆198Feb 28, 2026Updated 4 months ago
Many0therFunctions / MaskGCT-Text-To-Semantic-Finetune
View on GitHub
This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …
☆13Dec 4, 2024Updated last year
tango4j / llm_speaker_tagging
View on GitHub
SLT 2024 Challenge: Post-ASR-Speaker-Tagging
☆16Jun 16, 2024Updated 2 years ago
lallubharteja / KWS-Scripts
View on GitHub
Keyword Search Recipe for Subword ASR
☆30Jul 12, 2019Updated 7 years ago
thuhcsi / tacotron
View on GitHub
PyTorch implementation of Tacotron and Tacotron2
☆34Jul 19, 2022Updated 4 years ago
tts-tutorial / icassp2022
View on GitHub
☆64May 23, 2022Updated 4 years ago
quinte22 / bumblebee
View on GitHub
bumble bee transformer
☆14Apr 19, 2021Updated 5 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
thuhcsi / VAENAR-TTS
View on GitHub
The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.
☆144Jul 8, 2021Updated 5 years ago
sos1sos2Sixteen / aishell-3-baseline-fc
View on GitHub
The code for aishell-3 baseline acoustic model
☆70Nov 30, 2020Updated 5 years ago
worldarena / WorldArena
View on GitHub
the official repository of the WorldArena benchmark
☆14Mar 23, 2026Updated 4 months ago
HeimingX / TAG
View on GitHub
Official code for Attention-driven GUI Grounding, AAAI2025
☆16Dec 17, 2024Updated last year
bbepoch / CuteChineseTTS
View on GitHub
An open source Chinese TTS system, with opened data, opened full pipeline code and opened model
☆36Dec 29, 2018Updated 7 years ago
Riroaki / Chinese-Rhythm-Predictor
View on GitHub
基于随机森林和条件随机场的中文韵律预测模型
☆28Jul 25, 2024Updated 2 years ago
voberoi / voice-search-with-whisper-duckdb-and-metaphone
View on GitHub
This repository is a voice search demo using OpenAI Whisper, DuckDB, and the Metaphone algorithm. The associate blog post is here: https:…
☆13May 15, 2024Updated 2 years ago
npujcong / Chinese_PSP
View on GitHub
Chinese Prosodic Structure Prediction
☆10May 18, 2019Updated 7 years ago
makerjackie / tts-frontend-dataset
View on GitHub
TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization
☆104Feb 5, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sarang0909 / faq_chatbot
View on GitHub
COVID-19 FAQ chatbot in python along with user interfce
☆10Feb 2, 2024Updated 2 years ago
azraelkuan / repgan
View on GitHub
RepVgg + HiFiGAN
☆36Aug 10, 2022Updated 3 years ago
R1ckShi / SeACo-Paraformer
View on GitHub
[ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.
☆44Mar 15, 2024Updated 2 years ago
lovemefan / Paraformer-webserver
View on GitHub
paraformer web server build with sanic
☆28May 3, 2023Updated 3 years ago
Speech-Lab-IITM / English_ASR_Challenge
View on GitHub
English ASR Challenge organized by Speech Lab, IIT Madras
☆10Feb 3, 2021Updated 5 years ago
ivandotv / nextjs-koa-api
View on GitHub
Koa.js framework setup to run within Next.js API routes.
☆11May 23, 2026Updated 2 months ago
HKAB / whisper-finetune-vietnamese
View on GitHub
Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM
☆38Oct 6, 2023Updated 2 years ago