fabianoluzbr/neural-g2p-portuguese

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fabianoluzbr/neural-g2p-portuguese)

fabianoluzbr / neural-g2p-portuguese

Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly essential role for natural language processing, text-to-speech synthesis and automatic speech recognition systems. This project was adapted from https://github.com/hajix/G2P.

☆19

Alternatives and similar repositories for neural-g2p-portuguese

Users that are interested in neural-g2p-portuguese are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zjlww / dsp
View on GitHub
Digital Speech Processing in PyTorch.
☆15Aug 12, 2022Updated 3 years ago
shinhyeokoh / rwen
View on GitHub
☆14Jun 16, 2023Updated 3 years ago
nc-ai / speech
View on GitHub
☆17Aug 27, 2025Updated 11 months ago
Jackson-Kang / Prosody-augmentation-for-Text-to-speech
View on GitHub
Simple tool for speech dataset augmentation for modeling various prosodies.
☆14Jan 14, 2021Updated 5 years ago
ttslr / MonTTS
View on GitHub
☆16Dec 23, 2021Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
NTRLab / MediaSpeech
View on GitHub
☆22Jul 22, 2022Updated 4 years ago
jlian2 / Robust-Voice-Style-Transfer
View on GitHub
Demo for 2022 ICASSP
☆64Jun 14, 2022Updated 4 years ago
babua / TTSDatasetRecorder
View on GitHub
A simple app for recording speech datasets.
☆26Jun 27, 2022Updated 4 years ago
sil-ai / tts-singlish
View on GitHub
TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.
☆11Jan 11, 2020Updated 6 years ago
thu-spmi / CTC-TTS
View on GitHub
Code for CTC-TTS: LLM-based dual-streaming text-to-speech with CTC alignment, Interspeech 2026.
☆20Jun 9, 2026Updated last month
alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
zhaoyi2 / CVTE_chain_model_finetune
View on GitHub
finetune the chain model based on cvte open source model without traing any GMM for frame alignment
☆12Aug 6, 2020Updated 5 years ago
mzboito / IWSLT2022_Tamasheq_data
View on GitHub
Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…
☆18Nov 30, 2022Updated 3 years ago
Jackson-Kang / MFARunner
View on GitHub
A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.
☆45May 25, 2023Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
neosapience / editts
View on GitHub
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)
☆122Jan 24, 2023Updated 3 years ago
NeuroWave-ai / CUCVAE-TTS
View on GitHub
☆25Mar 12, 2022Updated 4 years ago
ljuvela / GELP
View on GitHub
☆27Apr 21, 2021Updated 5 years ago
innnky / FreeSVC
View on GitHub
基于FreeVC的歌声转换
☆21Dec 16, 2022Updated 3 years ago
dathudeptrai / FastSpeech2
View on GitHub
A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
☆11Aug 12, 2020Updated 5 years ago
samsad35 / source-filter-vae
View on GitHub
[SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder
☆46Apr 18, 2023Updated 3 years ago
pzelasko / kaldialign
View on GitHub
Python wrappers for Kaldi Levenshtein's distance and alignment code.
☆70Jun 15, 2026Updated last month
pilot7747 / VoxDIY
View on GitHub
This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.
☆16Jul 22, 2021Updated 5 years ago
interactiveaudiolab / ppgs
View on GitHub
High-Fidelity Neural Phonetic Posteriorgrams
☆124Feb 23, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
rmarcacini / ser-coraa-pt-br
View on GitHub
Emotion Recognition from Brazilian Portuguese Informal Spontaneous Speech
☆22Mar 21, 2022Updated 4 years ago
laurenceyoon / real-time-lyrics-alignment
View on GitHub
Codebase for 'A Real-Time Lyrics Alignment System Using Chroma And Phonetic Features For Classical Vocal Performance', ICASSP 2024
☆14Oct 4, 2024Updated last year
sony / bigvsan_eval
View on GitHub
Evaluation tool used in the BigVSAN paper
☆14Mar 22, 2024Updated 2 years ago
rishikksh20 / Zero-Shot-TTS
View on GitHub
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
☆34Sep 24, 2021Updated 4 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
zldzmfoq12 / VCtube
View on GitHub
A pakage for crawling audio from Youtube
☆42Aug 8, 2023Updated 2 years ago
Kyubyong / kss
View on GitHub
☆70Jan 7, 2021Updated 5 years ago
Team-BRAPA / BRAPA
View on GitHub
A Brazilian Portuguese Phonetic Alphabet made for Synthesizers
☆10Jul 6, 2026Updated 3 weeks ago
ex3ndr / supervoice-gpt
View on GitHub
GPT-style network for phonemization with durations of text
☆68Mar 21, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
openaudiolab / LLaST
View on GitHub
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
☆26Aug 11, 2024Updated last year
PlayVoice / VI-Speaker
View on GitHub
Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.
☆30Sep 16, 2022Updated 3 years ago
Jackson-Kang / Korean-phoneme-dictionary-generator
View on GitHub
Korean phoneme dictionary generator for training Montreal Forced Aligner (MFA)
☆13Feb 27, 2021Updated 5 years ago
yluo42 / SRVQ
View on GitHub
Spherical residual vector quantization (SRVQ)
☆31Aug 25, 2024Updated last year
cyhuang-tw / robust-vc
View on GitHub
☆11May 7, 2022Updated 4 years ago
ffxiong / uaspeech
View on GitHub
Baseline kaldi script for UA-SPEECH corpus
☆32Oct 16, 2024Updated last year
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year