fabianoluzbr / neural-g2p-portuguese
Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly essential role for natural language processing, text-to-speech synthesis and automatic speech recognition systems. This project was adapted from https://github.com/hajix/G2P.
☆19Updated 3 years ago
Alternatives and similar repositories for neural-g2p-portuguese:
Users that are interested in neural-g2p-portuguese are comparing it to the libraries listed below
- ☆15Updated 3 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆15Updated 4 months ago
- ☆23Updated 8 months ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated last year
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆16Updated 4 years ago
- Speech (audio) subjective evaluation system☆37Updated 4 years ago
- ☆36Updated 4 months ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Updated last year
- ☆16Updated 2 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated 3 weeks ago
- ☆25Updated 6 months ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆18Updated last year
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆18Updated last year
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆20Updated 3 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 3 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆35Updated 5 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated last year
- ☆31Updated last year
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- Implementation of multi-level Contrastive Predictive Coding (CPC) methods☆19Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated 11 months ago
- ☆40Updated 3 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆28Updated 4 years ago
- ☆15Updated 4 months ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆24Updated 4 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- ☆30Updated 2 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆30Updated 2 years ago