Edresson / TTS-Portuguese-CorpusLinks
Open Source Text-To-Speech Portuguese Dataset
☆178Updated last year
Alternatives and similar repositories for TTS-Portuguese-Corpus
Users that are interested in TTS-Portuguese-Corpus are comparing it to the libraries listed below
Sorting:
- 🗣️🇧🇷 Bases de áudio transcrito em Português Brasileiro☆71Updated 6 months ago
- ☆57Updated 2 years ago
- Wav2vec resources and models for Brazilian Portuguese☆36Updated 3 years ago
- Deep learning for Text to Speech☆27Updated 4 years ago
- Emotion Recognition from Brazilian Portuguese Informal Spontaneous Speech☆21Updated 3 years ago
- ☕🇧🇷 Scripts para o Kaldi em Português Brasileiro☆59Updated 3 years ago
- This is a legacy repo. Dev occurs now on GitHub.☆11Updated 4 years ago
- ☆204Updated 3 years ago
- Towards an end-to-end speech recognizer for Portuguese using deep neural networks☆22Updated 8 years ago
- HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools☆468Updated 2 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80Updated 2 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆360Updated 2 years ago
- Brasil TTS é um conjunto de sintetizadores de voz, em português do Brasil, que lê telas para portadores de deficiência visual. Transforma…☆65Updated 5 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆232Updated 3 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Updated 3 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆265Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆173Updated 2 years ago
- Grapheme to phoneme conversion with deep learning.☆419Updated 2 years ago
- ☆263Updated 3 years ago
- Python forced alignment☆94Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆154Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆106Updated 2 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 4 years ago
- ☆80Updated 5 months ago
- Python toolkit for speech processing☆72Updated last week
- Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset☆176Updated 6 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Updated 5 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆332Updated last year
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆170Updated 5 years ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, …☆292Updated 2 years ago