Edresson / TTS-Portuguese-Corpus
Open Source Text-To-Speech Portuguese Dataset
☆157Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for TTS-Portuguese-Corpus
- 🗣️🇧🇷 Bases de áudio transcrito em Português Brasileiro☆49Updated last year
- ☆43Updated last year
- Deep learning for Text to Speech☆26Updated 3 years ago
- Wav2vec resources and models for Brazilian Portuguese☆32Updated 2 years ago
- This is a legacy repo. Dev occurs now on GitHub.☆11Updated 3 years ago
- ☕🇧🇷 Scripts para o Kaldi em Português Brasileiro☆48Updated 2 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- Brasil TTS é um conjunto de sintetizadores de voz, em português do Brasil, que lê telas para portadores de deficiência visual. Transforma…☆54Updated 4 years ago
- Repository to document results of an Tacotron 2 adaptation for brazilian portuguese.☆17Updated 2 years ago
- Towards an end-to-end speech recognizer for Portuguese using deep neural networks☆23Updated 7 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆18Updated 3 years ago
- ☆11Updated 3 years ago
- Pytorch code of "A new automatic speech recognizer for Brazilian Portuguese based on deep neural networks and transfer learning" submitte…☆21Updated 5 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆57Updated 4 years ago
- Python toolkit for speech processing☆67Updated this week
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆158Updated 2 years ago
- ☆77Updated 5 months ago
- This is the M-AILABS Speech Dataset☆16Updated 4 months ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 5 years ago
- Spot the conversation: speaker diarisation in the wild☆123Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆71Updated 3 years ago
- Uma base de dados para estudo de regionalismos brasileiros através da voz.☆7Updated last year
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆29Updated 3 years ago
- 🐸TTS recipes for different datasets☆84Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆74Updated last year
- INF0429☆10Updated 4 months ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆237Updated 3 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆358Updated last year
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆125Updated 2 weeks ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆13Updated last year