kakaobrain/g2pm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kakaobrain/g2pm)

kakaobrain / g2pm

A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset

☆367

Alternatives and similar repositories for g2pm

Users that are interested in g2pm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Kyubyong / g2pC
View on GitHub
g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese
☆246Jul 10, 2019Updated 7 years ago
GitYCC / g2pW
View on GitHub
Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
☆404Jun 21, 2025Updated last year
speechio / chinese_text_normalization
View on GitHub
Chinese text normalization for speech processing
☆733Mar 18, 2023Updated 3 years ago
makerjackie / MTTS
View on GitHub
A Demo of Mandarin/Chinese TTS frontend
☆284Apr 18, 2022Updated 4 years ago
Kyubyong / g2p
View on GitHub
g2p: English Grapheme To Phoneme Conversion
☆927Jan 5, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
thuhcsi / Crystal
View on GitHub
Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
☆230Aug 17, 2020Updated 5 years ago
open-speech / cn-text-normalizer
View on GitHub
A python module that convert chinese written string to read string. 一个python包：将中文书面字符串转换为口语字符串。
☆124Oct 8, 2019Updated 6 years ago
MontrealCorpusTools / Montreal-Forced-Aligner
View on GitHub
Command line utility for forced alignment using Kaldi
☆1,847Jul 11, 2026Updated last week
zceng / LVCNet
View on GitHub
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
☆80Feb 24, 2021Updated 5 years ago
xcmyz / FastVocoder
View on GitHub
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
☆157Jul 2, 2021Updated 5 years ago
thuhcsi / VAENAR-TTS
View on GitHub
The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.
☆144Jul 8, 2021Updated 5 years ago
ivanvovk / durian-pytorch
View on GitHub
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.
☆184Aug 12, 2020Updated 5 years ago
thuhcsi / FlatTN
View on GitHub
Chinese Text Normalization and Dataset
☆91May 14, 2022Updated 4 years ago
Helsinki-NLP / prosody
View on GitHub
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
☆249Oct 30, 2019Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
xcmyz / FastSpeech
View on GitHub
The Implementation of FastSpeech based on pytorch.
☆885Jul 6, 2023Updated 3 years ago
wenet-e2e / wetts
View on GitHub
Production First and Production Ready End-to-End Text-to-Speech Toolkit
☆416Nov 20, 2025Updated 8 months ago
mozillazg / pypinyin-g2pW
View on GitHub
基于 g2pW 提升 pypinyin 的准确性
☆104Jun 24, 2023Updated 3 years ago
speechio / BigCiDian
View on GitHub
Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
☆263Oct 11, 2019Updated 6 years ago
yerfor / SyntaSpeech
View on GitHub
SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code
☆201Sep 4, 2022Updated 3 years ago
jaywalnut310 / glow-tts
View on GitHub
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
☆712Jul 12, 2022Updated 4 years ago
kan-bayashi / ParallelWaveGAN
View on GitHub
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
☆1,646Apr 22, 2024Updated 2 years ago
liusongxiang / efficient_tts
View on GitHub
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
☆116Dec 22, 2021Updated 4 years ago
lingjzhu / charsiu
View on GitHub
Charsiu: A neural phonetic aligner.
☆346Sep 19, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
makerjackie / tts-frontend-dataset
View on GitHub
TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization
☆104Feb 5, 2024Updated 2 years ago
tts-tutorial / survey
View on GitHub
A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf
☆371Nov 5, 2021Updated 4 years ago
Daisyqk / Automatic-Prosody-Annotation
View on GitHub
☆112Mar 9, 2026Updated 4 months ago
lingjzhu / CharsiuG2P
View on GitHub
Multilingual G2P in 100 languages
☆389May 26, 2023Updated 3 years ago
yanggeng1995 / Multi-band-WaveRNN
View on GitHub
☆45Dec 16, 2019Updated 6 years ago
descriptinc / cargan
View on GitHub
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
☆193Dec 8, 2022Updated 3 years ago
Yablon / auorange
View on GitHub
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
☆62Jun 8, 2021Updated 5 years ago
Zeqiang-Lai / Prosody_Prediction
View on GitHub
Predict prosody labels for Chinese sentences.
☆42Jul 7, 2022Updated 4 years ago
bshall / Tacotron
View on GitHub
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
☆115Dec 2, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
thuhcsi / SpanPSP
View on GitHub
☆76Apr 26, 2022Updated 4 years ago
janvainer / speedyspeech
View on GitHub
☆262Dec 8, 2022Updated 3 years ago
hhguo / MSMC-TTS
View on GitHub
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
☆168Apr 10, 2024Updated 2 years ago
wenet-e2e / speech-synthesis-paper
View on GitHub
List of speech synthesis papers.
☆1,074Jul 24, 2023Updated 2 years ago
liusongxiang / Large-Audio-Models
View on GitHub
Keep track of big models in audio domain, including speech, singing, music etc.
☆515Jul 3, 2026Updated 2 weeks ago
mutiann / neural-lexicon-reader
View on GitHub
Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge
☆21Jul 25, 2022Updated 3 years ago
open-speech / speech-aligner
View on GitHub
speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…
☆410Apr 8, 2020Updated 6 years ago