nicolalandro / train_coqui_tts_ita
My guide to create an italian TTS with Coqui
☆12Updated 2 years ago
Related projects: ⓘ
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆17Updated 2 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆45Updated last year
- Interface for using TTS and vocoder models in the form of a text editor☆19Updated last year
- ☆17Updated last year
- Linguistic processing for Common Voice☆50Updated 8 months ago
- asr2k☆48Updated 3 months ago
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!☆23Updated last year
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated last month
- Goodness of Pronunciation using Kaldi on Epa-DB database☆33Updated 8 months ago
- Putting flows on top of neural transducers for better TTS☆63Updated last month
- ☆56Updated last year
- A simple voice conversion tool☆15Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆95Updated last year
- Grapheme to phoneme model for PyTorch☆38Updated 2 years ago
- ☆38Updated 2 years ago
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to…☆29Updated 3 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆89Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆39Updated 2 months ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆27Updated 4 months ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆26Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆34Updated last year
- ☆19Updated 5 years ago
- 56 language, 1 model Multilingual ASR☆23Updated 3 years ago
- ☆25Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆67Updated last year
- ☆75Updated 3 months ago
- ☆13Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆71Updated 2 years ago