Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUIπΈTTS(Text-to-Speech) based high performing neural voice cloning systems for Bangla for the first time, supporting different SOTA models for Bangla and also Multilingual (Arabic+Bengali) code mixed TTS pipeline.
β44Aug 24, 2023Updated 2 years ago
Alternatives and similar repositories for comprehensive-bangla-tts
Users that are interested in comprehensive-bangla-tts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train and finutune text-to-speech models for Bengali and many other languages!β18Apr 2, 2025Updated last year
- Bangla Unicode Normalizationβ23May 26, 2024Updated 2 years ago
- Bangla TTS Inference pipeline using Vit TTSβ13Mar 24, 2024Updated 2 years ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library πβ21May 20, 2025Updated last year
- β13Sep 21, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paperβ21Jun 23, 2022Updated 3 years ago
- Synthetic data generation for bangla OCRβ18Dec 1, 2022Updated 3 years ago
- CSE476-Machine-Learning-Labβ17Jul 1, 2023Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription usingβ¦β30May 27, 2023Updated 3 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networksβ20Feb 6, 2021Updated 5 years ago
- β25Mar 12, 2022Updated 4 years ago
- Colab notebooks for Next-gen Kaldiβ31Oct 12, 2025Updated 7 months ago
- Neural text to speech system that uses eSpeak as a text/phoneme front-endβ16Oct 20, 2021Updated 4 years ago
- text to speechβ10Mar 19, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- CML-TTS: A Multilingual Dataset for Speech Synthesisβ36Jul 31, 2024Updated last year
- TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.β11Jan 11, 2020Updated 6 years ago
- Automatic Context Sensitive Spelling Correction for Bangla Text Using Bert and Levenstein Distanceβ21Nov 18, 2024Updated last year
- A list of podcast URLs scraped from the Apple podcast database in late 2021, including a script for downloading those podcasts.β44Mar 9, 2022Updated 4 years ago
- Bengali transformer using transformersβ22Apr 29, 2025Updated last year
- Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard feaβ¦β16Jun 2, 2026Updated last week
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.β13Oct 2, 2025Updated 8 months ago
- Bangla Machine Translator based on seq2seq Architectureβ45Mar 30, 2022Updated 4 years ago
- β18Apr 28, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Onset-and-Offset-Aware Sound Event Detectionβ21Feb 10, 2025Updated last year
- Transformer based Bangla Speech Recognition | Encoder Decoder Architectureβ61Apr 14, 2023Updated 3 years ago
- Tihu dictionary for Persian languageβ12Sep 8, 2019Updated 6 years ago
- llmon-py is a multimodal webui for Llama 3-8B.β15Jul 1, 2024Updated last year
- A Grapheme to Phoneme model using LSTM implemented in pytorchβ14Jul 6, 2022Updated 3 years ago
- A real time implementation of the ddsp from google magenta.β16Nov 8, 2021Updated 4 years ago
- Conformer-based Metric GAN for speech enhancementβ27May 3, 2024Updated 2 years ago
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.β12Mar 6, 2023Updated 3 years ago
- γJoin our constellation of stargazers!βοΈγAn interactive AI-powered story generator that creates dynamic narratives through collaborative β¦β13Updated this week
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPAβ18Aug 16, 2024Updated last year
- Bangla text to speech, Multilingual (Bangla, English) real-time speech synthesis libraryβ92Oct 17, 2024Updated last year
- β14Aug 1, 2025Updated 10 months ago
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNsβ25Jul 21, 2024Updated last year
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networksβ17Aug 18, 2023Updated 2 years ago
- This is a command-line utility tool to fetch GeoLocation information for a given IP Address.β18Mar 23, 2020Updated 6 years ago
- StyleTTS2 + Vocos as a Decoderβ13Mar 24, 2025Updated last year