nicolalandro / train_coqui_tts_ita
My guide to create an italian TTS with Coqui
☆14Updated 3 years ago
Alternatives and similar repositories for train_coqui_tts_ita:
Users that are interested in train_coqui_tts_ita are comparing it to the libraries listed below
- Interface for using TTS and vocoder models in the form of a text editor☆20Updated 2 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 8 months ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆22Updated 2 years ago
- ☆17Updated last year
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆20Updated last week
- Multilingual Grapheme to Phoneme☆49Updated 9 years ago
- asr2k☆49Updated 9 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!☆23Updated last month
- A simple voice conversion tool☆17Updated 3 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆48Updated 8 months ago
- A repository for dictionaries to be used with the Prosodylab-Aligner☆17Updated 10 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- ☆17Updated 3 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- Linguistic processing for Common Voice☆55Updated last year
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Updated 3 years ago
- Simple PyTorch Denoisers for Waveform Audio☆34Updated last month
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- ☆45Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- ☆42Updated 2 years ago
- ☆56Updated 2 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated 2 years ago
- ☆8Updated last year
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆110Updated 2 years ago