fabianoluzbr / neural-g2p-portuguese
Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly essential role for natural language processing, text-to-speech synthesis and automatic speech recognition systems. This project was adapted from https://github.com/hajix/G2P.
☆19Updated 3 years ago
Alternatives and similar repositories for neural-g2p-portuguese:
Users that are interested in neural-g2p-portuguese are comparing it to the libraries listed below
- ☆15Updated 3 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated last year
- ☆40Updated 3 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆17Updated 6 months ago
- ☆23Updated 10 months ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆29Updated 4 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- ☆12Updated 2 months ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- ☆38Updated 7 months ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆36Updated last year
- This repository contains laughter-related synthesis systems.☆13Updated 4 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆64Updated last year
- ☆31Updated last year
- ☆24Updated 3 years ago
- multilingual speech aligner☆74Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆46Updated 3 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Updated 5 years ago
- Baseline kaldi script for UA-SPEECH corpus☆30Updated 6 months ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated last month
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆18Updated last year
- Just another FastSpeech 2 but cleaner code :)☆26Updated 9 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆21Updated last year
- ☆30Updated 5 months ago
- Sequence alignement methods with helpers for PyTorch.☆24Updated 2 years ago
- ☆30Updated 2 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year