mobassir94 / comprehensive-bangla-tts
Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUI🐸TTS(Text-to-Speech) based high performing neural voice cloning systems for Bangla for the first time, supporting different SOTA models for Bangla and also Multilingual (Arabic+Bengali) code mixed TTS pipeline.
☆38Updated last year
Alternatives and similar repositories for comprehensive-bangla-tts:
Users that are interested in comprehensive-bangla-tts are comparing it to the libraries listed below
- Transformer based Bangla Speech Recognition☆52Updated last year
- ☆43Updated 2 years ago
- Bangla Unicode Normalization☆18Updated 9 months ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆22Updated 7 months ago
- Text to Speech for Indic languages☆50Updated 2 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 2 years ago
- ☆44Updated 2 years ago
- Simple PyTorch Denoisers for Waveform Audio☆34Updated last week
- Bangla text to speech, Multilingual (Bangla, English) real-time ([almost] in a GPU) speech synthesis library☆88Updated 4 months ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆93Updated 4 months ago
- Convert English text from written expressions into spoken forms☆24Updated 2 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆22Updated 2 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batc…☆35Updated 9 months ago
- speech recognition using Kaldi framework☆12Updated 5 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- A TTS model that makes a speaker speak new languages☆76Updated 8 months ago
- Zero-shot Audio Classification using Whisper☆80Updated 2 years ago
- Swarah: Indian-English speech dataset collected across the country☆28Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- Finetune VITS and MMS using HuggingFace's tools☆134Updated 11 months ago
- Implementation of Google's USM speech model in Pytorch☆30Updated last month
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆35Updated 2 years ago
- Create training data for training a voice cloner for bark text to speech.☆43Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆47Updated 8 months ago
- Automatic Context Sensitive Spelling Correction for Bangla Text Using Bert and Levenstein Distance☆20Updated 3 months ago