Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023
☆57May 7, 2023Updated 2 years ago
Alternatives and similar repositories for text2speech
Users that are interested in text2speech are comparing it to the libraries listed below
Sorting:
- Python Hindi Concatenative Based TTS using Phoneme Database☆25Feb 2, 2022Updated 4 years ago
- Indic-Conformer models for ASR☆21Jul 19, 2024Updated last year
- A JavaScript and TypeScript port of PyTorch C++ library (libtorch) - Node.js N-API bindings for libtorch.☆16Jan 15, 2023Updated 3 years ago
- A streaming whisper server for on-prem transcription☆23Aug 15, 2024Updated last year
- ☆24Sep 1, 2023Updated 2 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Text to Speech for Indic languages☆52Mar 23, 2022Updated 3 years ago
- Github action to upload datasets to kaggle☆24Jan 8, 2026Updated last month
- ☆37Sep 21, 2025Updated 5 months ago
- Fast kernel library for Diffusion inference with multiple compute backends.☆84Jan 24, 2026Updated last month
- Real-time 'Code Red' rocket alerts in Israel☆14Updated this week
- AI and IoT based Smart Parking☆10Apr 15, 2022Updated 3 years ago
- ☆10Sep 10, 2023Updated 2 years ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Jan 12, 2026Updated last month
- ☆32Dec 4, 2022Updated 3 years ago
- Acoustic-prosodic entrainment measurement in spoken dialogue and approximation of the evolution of a speaker’s a/p features.☆12Feb 26, 2024Updated 2 years ago
- Code for "Out-of-Distribution Detection using Synthetic Data Generation"☆21Feb 6, 2025Updated last year
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆42Mar 12, 2023Updated 2 years ago
- Phase Vocoder and Wavelet Transform Implementation for Pitch Shifting a sound signal☆11Jul 27, 2020Updated 5 years ago
- ☆10Jan 10, 2024Updated 2 years ago
- Android Application that allow the user to locate his position using the wifi. Once the localization is done the user can track his move …☆11Oct 21, 2016Updated 9 years ago
- Deep learning-based audio spoofing attack detection experiments for speaker verification.☆14Apr 20, 2023Updated 2 years ago
- This repository contains python notebook for generating new set of images from existing images using Generative Adversarial Networks. The…☆11Sep 6, 2019Updated 6 years ago
- ☆12Jan 11, 2023Updated 3 years ago
- Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learner…☆18Feb 25, 2026Updated last week
- Using large language models to maintain AI_CHANGELOG.md☆14Jul 15, 2024Updated last year
- Ionic 3 authentication template/showcase☆10Jun 20, 2017Updated 8 years ago
- A voice spoofing detection system, based on paper presented at ICSPIS 2021☆10Feb 11, 2022Updated 4 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆37Oct 6, 2023Updated 2 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆87Jul 25, 2022Updated 3 years ago
- Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)☆121Jan 24, 2023Updated 3 years ago
- ComfyUI ShadowR Wrapper☆15Feb 21, 2025Updated last year
- Angular 15 Auth Boilerplate - Sign Up with Verification, Login and Forgot Password☆13Apr 28, 2023Updated 2 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 3 years ago
- ☆11May 9, 2023Updated 2 years ago
- Simple CLI frontend for flashcards-core☆12Jul 30, 2021Updated 4 years ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Aug 15, 2024Updated last year
- Cricket analytics for humans 🏏☆12Sep 4, 2022Updated 3 years ago
- ☆10Sep 19, 2023Updated 2 years ago