kakaobrain / g2pmView external linksLinks
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
☆361Dec 24, 2021Updated 4 years ago
Alternatives and similar repositories for g2pm
Users that are interested in g2pm are comparing it to the libraries listed below
Sorting:
- g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese☆243Jul 10, 2019Updated 6 years ago
- Chinese text normalization for speech processing☆720Mar 18, 2023Updated 2 years ago
- Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)☆377Jun 21, 2025Updated 7 months ago
- A Demo of Mandarin/Chinese TTS frontend☆285Apr 18, 2022Updated 3 years ago
- g2p: English Grapheme To Phoneme Conversion☆911Jan 5, 2023Updated 3 years ago
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆229Aug 17, 2020Updated 5 years ago
- A python module that convert chinese written string to read string. 一个python包:将中文书面字符串转换为口语字符串。☆122Oct 8, 2019Updated 6 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 4 years ago
- Command line utility for forced alignment using Kaldi☆1,746Feb 2, 2026Updated last week
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆157Jul 2, 2021Updated 4 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆184Aug 12, 2020Updated 5 years ago
- Charsiu: A neural phonetic aligner.☆329Sep 19, 2022Updated 3 years ago
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆262Oct 11, 2019Updated 6 years ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Search☆701Jul 12, 2022Updated 3 years ago
- Chinese Text Normalization and Dataset☆90May 14, 2022Updated 3 years ago
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆415Nov 20, 2025Updated 2 months ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆246Oct 30, 2019Updated 6 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆203Sep 4, 2022Updated 3 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆116Dec 22, 2021Updated 4 years ago
- The Implementation of FastSpeech based on pytorch.☆880Jul 6, 2023Updated 2 years ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆103Feb 5, 2024Updated 2 years ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,637Apr 22, 2024Updated last year
- Multilingual G2P in 100 languages☆374May 26, 2023Updated 2 years ago
- A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf☆371Nov 5, 2021Updated 4 years ago
- ☆45Dec 16, 2019Updated 6 years ago
- 基于 g2pW 提升 pypinyin 的准确性☆103Jun 24, 2023Updated 2 years ago
- A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis☆114Dec 2, 2020Updated 5 years ago
- Keep track of big models in audio domain, including speech, singing, music etc.☆506Sep 26, 2024Updated last year
- ☆111Apr 6, 2022Updated 3 years ago
- List of speech synthesis papers.☆1,063Jul 24, 2023Updated 2 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆321Jul 25, 2024Updated last year
- ☆77Apr 26, 2022Updated 3 years ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆167Apr 10, 2024Updated last year
- ☆262Dec 8, 2022Updated 3 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆410Apr 8, 2020Updated 5 years ago
- Adaptive Vocoder for Custom Voice☆61Sep 22, 2022Updated 3 years ago
- Official implementation of BVAE-TTS☆173Sep 26, 2022Updated 3 years ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆271Jul 15, 2025Updated 6 months ago