Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUIπΈTTS(Text-to-Speech) based high performing neural voice cloning systems for Bangla for the first time, supporting different SOTA models for Bangla and also Multilingual (Arabic+Bengali) code mixed TTS pipeline.
β44Aug 24, 2023Updated 2 years ago
Alternatives and similar repositories for comprehensive-bangla-tts
Users that are interested in comprehensive-bangla-tts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train and finutune text-to-speech models for Bengali and many other languages!β18Apr 2, 2025Updated 11 months ago
- Bangla TTS Inference pipeline using Vit TTSβ13Mar 24, 2024Updated 2 years ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library πβ20May 20, 2025Updated 10 months ago
- β13Sep 21, 2022Updated 3 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paperβ21Jun 23, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languagesβ14Aug 9, 2024Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription usingβ¦β30May 27, 2023Updated 2 years ago
- This is a side project where me and my friend try to generate synthetic data in bangla from deepseek-r1. So that can be used for model diβ¦β11Jun 28, 2025Updated 9 months ago
- Cloudflare worker as a web proxyβ10May 4, 2022Updated 3 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networksβ20Feb 6, 2021Updated 5 years ago
- Colab notebooks for Next-gen Kaldiβ31Oct 12, 2025Updated 5 months ago
- My guide to create an italian TTS with Coquiβ14Feb 2, 2022Updated 4 years ago
- β25Mar 12, 2022Updated 4 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesisβ34Jul 31, 2024Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Neural text to speech system that uses eSpeak as a text/phoneme front-endβ16Oct 20, 2021Updated 4 years ago
- text to speechβ10Mar 19, 2024Updated 2 years ago
- TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.β11Jan 11, 2020Updated 6 years ago
- Automatic Context Sensitive Spelling Correction for Bangla Text Using Bert and Levenstein Distanceβ21Nov 18, 2024Updated last year
- A list of podcast URLs scraped from the Apple podcast database in late 2021, including a script for downloading those podcasts.β43Mar 9, 2022Updated 4 years ago
- Bert-Based persian spell-checkerβ18Mar 9, 2024Updated 2 years ago
- Bengali transformer using transformersβ22Apr 29, 2025Updated 11 months ago
- Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard feaβ¦β15Jun 12, 2023Updated 2 years ago
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.β13Oct 2, 2025Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Bangla Machine Translator based on seq2seq Architectureβ44Mar 30, 2022Updated 4 years ago
- β18Apr 28, 2021Updated 4 years ago
- Onset-and-Offset-Aware Sound Event Detectionβ21Feb 10, 2025Updated last year
- A Grapheme to Phoneme model using LSTM implemented in pytorchβ13Jul 6, 2022Updated 3 years ago
- Writing Bangla in Latex with Overleaf Online Editorβ28Aug 12, 2024Updated last year
- Tihu dictionary for Persian languageβ12Sep 8, 2019Updated 6 years ago
- llmon-py is a multimodal webui for Llama 3-8B.β16Jul 1, 2024Updated last year
- A real time implementation of the ddsp from google magenta.β16Nov 8, 2021Updated 4 years ago
- This is the experimental description of MnTTS2.β12Apr 11, 2024Updated last year
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Fine tuned llama 3 models for context based question answering in bengali language.β18Oct 14, 2024Updated last year
- Conformer-based Metric GAN for speech enhancementβ27May 3, 2024Updated last year
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.β12Mar 6, 2023Updated 3 years ago
- How to run Detectron2 on Windows using WSL2 and RTX30xx cards.β14Mar 17, 2021Updated 5 years ago
- γJoin our constellation of stargazers!βοΈγAn interactive AI-powered story generator that creates dynamic narratives through collaborative β¦β13Jul 9, 2025Updated 8 months ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPAβ18Aug 16, 2024Updated last year
- β14Aug 1, 2025Updated 8 months ago