Kyubyong/g2p

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Kyubyong/g2p)

Kyubyong / g2p

g2p: English Grapheme To Phoneme Conversion

☆927

Alternatives and similar repositories for g2p

Users that are interested in g2p are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bootphon / phonemizer
View on GitHub
Simple text to phones converter for multiple languages
☆1,557Sep 26, 2024Updated last year
kakaobrain / g2pm
View on GitHub
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
☆367Dec 24, 2021Updated 4 years ago
cmusphinx / g2p-seq2seq
View on GitHub
G2P with Tensorflow
☆680Jul 29, 2024Updated last year
MontrealCorpusTools / Montreal-Forced-Aligner
View on GitHub
Command line utility for forced alignment using Kaldi
☆1,847Jul 11, 2026Updated last week
jaywalnut310 / glow-tts
View on GitHub
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
☆712Jul 12, 2022Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
kan-bayashi / ParallelWaveGAN
View on GitHub
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
☆1,645Apr 22, 2024Updated 2 years ago
Kyubyong / g2pC
View on GitHub
g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese
☆246Jul 10, 2019Updated 7 years ago
jik876 / hifi-gan
View on GitHub
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
☆2,357Jul 27, 2024Updated last year
xcmyz / FastSpeech
View on GitHub
The Implementation of FastSpeech based on pytorch.
☆885Jul 6, 2023Updated 3 years ago
speechio / chinese_text_normalization
View on GitHub
Chinese text normalization for speech processing
☆733Mar 18, 2023Updated 3 years ago
AdolfVonKleist / Phonetisaurus
View on GitHub
Phonetisaurus G2P
☆516Jun 1, 2024Updated 2 years ago
xiph / LPCNet
View on GitHub
Efficient neural speech synthesis
☆1,218Sep 21, 2024Updated last year
axelspringer / DeepPhonemizer
View on GitHub
Grapheme to phoneme conversion with deep learning.
☆432Dec 8, 2023Updated 2 years ago
ming024 / FastSpeech2
View on GitHub
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
☆2,185Oct 27, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
janvainer / speedyspeech
View on GitHub
☆262Dec 8, 2022Updated 3 years ago
wenet-e2e / speech-synthesis-paper
View on GitHub
List of speech synthesis papers.
☆1,074Jul 24, 2023Updated 2 years ago
JeremyCCHsu / Python-Wrapper-for-World-Vocoder
View on GitHub
A Python wrapper for the high-quality vocoder "World"
☆789Jan 21, 2025Updated last year
lingjzhu / charsiu
View on GitHub
Charsiu: A neural phonetic aligner.
☆346Sep 19, 2022Updated 3 years ago
espnet / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆9,891Updated this week
NVIDIA / mellotron
View on GitHub
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…
☆870Jul 22, 2023Updated 2 years ago
Kyubyong / css10
View on GitHub
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
☆490Mar 6, 2020Updated 6 years ago
speechio / BigCiDian
View on GitHub
Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
☆263Oct 11, 2019Updated 6 years ago
SpeechColab / GigaSpeech
View on GitHub
Large, modern dataset for speech recognition
☆731Feb 26, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
soobinseo / Transformer-TTS
View on GitHub
A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
☆690Nov 8, 2023Updated 2 years ago
sequitur-g2p / sequitur-g2p
View on GitHub
This is a github repository of the abandonware Sequitur G2P by Bisani & Ney
☆174Dec 16, 2025Updated 7 months ago
huawei-noah / Speech-Backbones
View on GitHub
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
☆604Sep 18, 2023Updated 2 years ago
facebookresearch / speech-resynthesis
View on GitHub
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…
☆416Aug 29, 2023Updated 2 years ago
k2-fsa / fast_rnnt
View on GitHub
A torch implementation of a recursion which turns out to be useful for RNN-T.
☆149Aug 25, 2023Updated 2 years ago
open-speech / speech-aligner
View on GitHub
speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…
☆410Apr 8, 2020Updated 6 years ago
bshall / ZeroSpeech
View on GitHub
VQ-VAE for Acoustic Unit Discovery and Voice Conversion
☆339Jul 6, 2023Updated 3 years ago
lhotse-speech / lhotse
View on GitHub
Tools for handling multimodal data in machine learning projects.
☆1,143Jun 22, 2026Updated 3 weeks ago
liusongxiang / ppg-vc
View on GitHub
PPG-Based Voice Conversion
☆348Jul 22, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lumaku / ctc-segmentation
View on GitHub
Segment an audio file and obtain utterance alignments. (Python package)
☆348May 15, 2024Updated 2 years ago
KevinMIN95 / StyleSpeech
View on GitHub
Official implementation of Meta-StyleSpeech and StyleSpeech
☆253Feb 9, 2022Updated 4 years ago
s3prl / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit
☆2,557Mar 12, 2026Updated 4 months ago
microsoft / NeuralSpeech
View on GitHub
☆1,461Feb 11, 2024Updated 2 years ago
aliutkus / speechmetrics
View on GitHub
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
☆1,050Jul 5, 2023Updated 3 years ago
auspicious3000 / SpeechSplit
View on GitHub
Unsupervised Speech Decomposition Via Triple Information Bottleneck
☆697Oct 23, 2024Updated last year
yangdongchao / AcademiCodec
View on GitHub
AcademiCodec: An Open Source Audio Codec Model for Academic Research
☆674Dec 27, 2023Updated 2 years ago