gokulkarthik/text2speech

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/gokulkarthik/text2speech)

gokulkarthik / text2speech

Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023

☆57

Alternatives and similar repositories for text2speech

Users that are interested in text2speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

parvatijay2901 / Hindi-ASR-and-TTS
View on GitHub
EC499: Major Project
☆11Jun 25, 2023Updated 3 years ago
epk2112 / fairseq_meta_mms_Google_Colab_implementation
View on GitHub
The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)👇
☆19May 25, 2023Updated 3 years ago
AI4Bharat / indic-asr-api-backend
View on GitHub
Indic-Conformer models for ASR
☆19Jul 19, 2024Updated 2 years ago
AI4Bharat / Indic-TTS
View on GitHub
Text-to-Speech for languages of India
☆378Nov 8, 2024Updated last year
HKAB / whisper-finetune-vietnamese
View on GitHub
Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM
☆38Oct 6, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
EmergingUnicorns / DeepPaint
View on GitHub
☆11Oct 22, 2023Updated 2 years ago
AM-ROBOTS / SaveRestrictedContentBot
View on GitHub
SaveRestrictedContentBot @AM_ROBOTS
☆11Oct 29, 2022Updated 3 years ago
Open-Speech-EkStep / vakyansh-tts
View on GitHub
Text to Speech for Indic languages
☆53Mar 23, 2022Updated 4 years ago
OdiaGenAI / Indic_LLM_Resource_Catalog
View on GitHub
A Catalog lists instruction sets, models available for Indic language
☆10Mar 14, 2024Updated 2 years ago
rajatbansal01 / OIC-2020
View on GitHub
AI and IoT based Smart Parking
☆10Apr 15, 2022Updated 4 years ago
cedrickchee / tch-js
View on GitHub
A JavaScript and TypeScript port of PyTorch C++ library (libtorch) - Node.js N-API bindings for libtorch.
☆17Jan 15, 2023Updated 3 years ago
aifenaike / Intent-Recognition-Using-BERT
View on GitHub
Transformer-based Model to recognize any of 7 unique intents from the Snips personal voice assistant.
☆13Mar 11, 2022Updated 4 years ago
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
koushiksrivats / face_attribute_attack
View on GitHub
Official implementation of the paper "Evading Forensic Classifiers with Attribute-Conditioned Adversarial Faces" (CVPR 23)
☆46Jan 24, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
RancidBacon / audiogif
View on GitHub
Audio GIFs (.a.gif) -- "Sounds like a Bad Idea."
☆12Jun 7, 2019Updated 7 years ago
lcrojano / Giphy_Explorer
View on GitHub
Search any gif from Giphy API
☆12Oct 28, 2024Updated last year
tsunrise / everybody-compose
View on GitHub
Everybody Compose: Deep Beats To Music
☆12Apr 12, 2023Updated 3 years ago
Valentyn1997 / kg-alignment-lessons-learned
View on GitHub
Implementation of reproducibility paper "Knowledge Graph Entity Alignment with Graph Convolutional Networks: Lessons Learned"
☆16Sep 28, 2020Updated 5 years ago
declare-lab / speech-adapters
View on GitHub
Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…
☆43Mar 12, 2023Updated 3 years ago
rameshvarun / codec2-emscripten
View on GitHub
The Codec 2 speech codec, compiled to WASM using Emscripten.
☆14Apr 27, 2023Updated 3 years ago
mzeeshankaramat / SafeAgents
View on GitHub
☆20Jun 4, 2026Updated last month
michaelcpuckett / express-worker
View on GitHub
Express.js ported to a Service Worker context
☆18Mar 6, 2025Updated last year
i8sumPi / bob-the-walking-AI
View on GitHub
A simple, yet effective, walking AI.
☆12Aug 30, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
bekirbakar / replay-attack-detection
View on GitHub
Deep learning-based audio spoofing attack detection experiments for speaker verification.
☆14Apr 20, 2023Updated 3 years ago
jjelosua / ML_audio_classification
View on GitHub
Audio classification using Machine Learning
☆13Dec 17, 2015Updated 10 years ago
notsyncing / ace
View on GitHub
Build Cordova apps with true native UI
☆13May 8, 2021Updated 5 years ago
Panopath / cordova-app-updater
View on GitHub
An easy-to-use, efficient, powerful tool to remote update your cordova app.
☆11Aug 13, 2018Updated 7 years ago
EnkrateiaLucca / openai_whisper
View on GitHub
Python script for my article and Youtube video on building a streamlit app to use whisper for speech-to-text transcription
☆15Mar 17, 2023Updated 3 years ago
souvikg544 / TTS_Data_Maker
View on GitHub
Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…
☆28Mar 14, 2023Updated 3 years ago
TANQIanQ / Enhance-NeRF
View on GitHub
☆22Jul 11, 2023Updated 3 years ago
Somoy73 / Frontend-UI-Element-Detection-and-Classification
View on GitHub
Detection and Classification of UI Elements of Web pages and Apps from Wireframe Sketches
☆10Oct 9, 2023Updated 2 years ago
ioppermann / ezMPEG
View on GitHub
ezMPEG is an easy-to-use and easy-to-understand MPEG1 video encoder API
☆11Mar 26, 2017Updated 9 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sarulab-speech / whisper-asr-finetune
View on GitHub
☆32Dec 4, 2022Updated 3 years ago
XierHacker / Model_Fusion_Based_Prosody_Prediction
View on GitHub
Model Fusion Based Prosody Prediction
☆17Mar 18, 2018Updated 8 years ago
yataoz / face_reenact_GDPW
View on GitHub
Code repository for the BMVC 2022 paper: Geometry Driven Progressive Warping for One Shot Face Animation
☆12Jan 6, 2023Updated 3 years ago
AI4Bharat / IndicWav2Vec
View on GitHub
Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2
☆117Aug 28, 2025Updated 11 months ago
BH-So / unsupervised-paraphrase-generation
View on GitHub
"Unsupervised Paraphrase Generation using Pre-trained Language Model."
☆22Aug 28, 2020Updated 5 years ago
Fasal-Tech / cordova-app-update-plugin
View on GitHub
In app update support for cordova
☆11Oct 28, 2025Updated 9 months ago
Lhx94As / E2E-language-diarization
View on GitHub
Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>
☆19Jan 23, 2022Updated 4 years ago