Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly essential role for natural language processing, text-to-speech synthesis and automatic speech recognition systems. This project was adapted from https://github.com/hajix/G2P.
☆19Jun 14, 2021Updated 4 years ago
Alternatives and similar repositories for neural-g2p-portuguese
Users that are interested in neural-g2p-portuguese are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Digital Speech Processing in PyTorch.☆15Aug 12, 2022Updated 3 years ago
- ☆14Jun 16, 2023Updated 2 years ago
- ☆17Aug 27, 2025Updated 8 months ago
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Jan 14, 2021Updated 5 years ago
- ☆16Dec 23, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 3 years ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆37Sep 9, 2025Updated 8 months ago
- ☆20Jul 22, 2022Updated 3 years ago
- Demo for 2022 ICASSP☆64Jun 14, 2022Updated 3 years ago
- TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.☆11Jan 11, 2020Updated 6 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Nov 30, 2022Updated 3 years ago
- A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.☆45May 25, 2023Updated 2 years ago
- ☆25Mar 12, 2022Updated 4 years ago
- 基于FreeVC的歌声转换☆21Dec 16, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Emotion Recognition from Brazilian Portuguese Informal Spontaneous Speech☆22Mar 21, 2022Updated 4 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- ☆26Apr 21, 2021Updated 5 years ago
- A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech☆11Aug 12, 2020Updated 5 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆46Apr 18, 2023Updated 3 years ago
- A pakage for crawling audio from Youtube☆42Aug 8, 2023Updated 2 years ago
- Codebase for 'A Real-Time Lyrics Alignment System Using Chroma And Phonetic Features For Classical Vocal Performance', ICASSP 2024☆14Oct 4, 2024Updated last year
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Sep 24, 2021Updated 4 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 4 years ago
- High-Fidelity Neural Phonetic Posteriorgrams☆122Feb 23, 2025Updated last year
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Apr 9, 2026Updated last month
- ☆69Jan 7, 2021Updated 5 years ago
- GPT-style network for phonemization with durations of text☆69Mar 21, 2024Updated 2 years ago
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆30Sep 16, 2022Updated 3 years ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆26Aug 11, 2024Updated last year
- Spherical residual vector quantization (SRVQ)☆31Aug 25, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Korean phoneme dictionary generator for training Montreal Forced Aligner (MFA)☆13Feb 27, 2021Updated 5 years ago
- ☆11May 7, 2022Updated 4 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- ☆81Aug 8, 2025Updated 9 months ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago