g2p: English Grapheme To Phoneme Conversion
☆922Jan 5, 2023Updated 3 years ago
Alternatives and similar repositories for g2p
Users that are interested in g2p are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple text to phones converter for multiple languages☆1,552Sep 26, 2024Updated last year
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset☆365Dec 24, 2021Updated 4 years ago
- G2P with Tensorflow☆680Jul 29, 2024Updated last year
- Command line utility for forced alignment using Kaldi☆1,829Mar 31, 2026Updated 2 months ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Search☆713Jul 12, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,642Apr 22, 2024Updated 2 years ago
- g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese☆245Jul 10, 2019Updated 6 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆2,351Jul 27, 2024Updated last year
- The Implementation of FastSpeech based on pytorch.☆880Jul 6, 2023Updated 2 years ago
- Chinese text normalization for speech processing☆731Mar 18, 2023Updated 3 years ago
- Phonetisaurus G2P☆516Jun 1, 2024Updated 2 years ago
- Efficient neural speech synthesis☆1,212Sep 21, 2024Updated last year
- Grapheme to phoneme conversion with deep learning.☆426Dec 8, 2023Updated 2 years ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆2,178Oct 27, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆262Dec 8, 2022Updated 3 years ago
- List of speech synthesis papers.☆1,072Jul 24, 2023Updated 2 years ago
- A Python wrapper for the high-quality vocoder "World"☆788Jan 21, 2025Updated last year
- Charsiu: A neural phonetic aligner.☆344Sep 19, 2022Updated 3 years ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆871Jul 22, 2023Updated 2 years ago
- End-to-End Speech Processing Toolkit☆9,855Updated this week
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆487Mar 6, 2020Updated 6 years ago
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆264Oct 11, 2019Updated 6 years ago
- Large, modern dataset for speech recognition☆726Feb 26, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"☆691Nov 8, 2023Updated 2 years ago
- This is a github repository of the abandonware Sequitur G2P by Bisani & Ney☆174Dec 16, 2025Updated 5 months ago
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆414Aug 29, 2023Updated 2 years ago
- This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.☆604Sep 18, 2023Updated 2 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆410Apr 8, 2020Updated 6 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆149Aug 25, 2023Updated 2 years ago
- PPG-Based Voice Conversion☆351Jul 22, 2022Updated 3 years ago
- Tools for handling multimodal data in machine learning projects.☆1,131May 28, 2026Updated last week
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆339Jul 6, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official implementation of Meta-StyleSpeech and StyleSpeech☆252Feb 9, 2022Updated 4 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆347May 15, 2024Updated 2 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,554Mar 12, 2026Updated 2 months ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆367Dec 6, 2018Updated 7 years ago
- Unsupervised Speech Decomposition Via Triple Information Bottleneck☆700Oct 23, 2024Updated last year
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆1,051Jul 5, 2023Updated 2 years ago
- ☆1,460Feb 11, 2024Updated 2 years ago