Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023
β57May 7, 2023Updated 3 years ago
Alternatives and similar repositories for text2speech
Users that are interested in text2speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)πβ19May 25, 2023Updated 3 years ago
- Indic-Conformer models for ASRβ19Jul 19, 2024Updated last year
- A streaming whisper server for on-prem transcriptionβ23Aug 15, 2024Updated last year
- Text-to-Speech for languages of Indiaβ374Nov 8, 2024Updated last year
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLMβ38Oct 6, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This repository shows how to implement a basic model for multimodal entailment.β10Aug 17, 2021Updated 4 years ago
- A Catalog lists instruction sets, models available for Indic languageβ10Mar 14, 2024Updated 2 years ago
- This repository contains the HiNER dataset released with our paper at LREC 2022β16Jun 6, 2023Updated 3 years ago
- Text to Speech for Indic languagesβ53Mar 23, 2022Updated 4 years ago
- Cricket analytics for humans πβ12Sep 4, 2022Updated 3 years ago
- β45Dec 15, 2022Updated 3 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into oneβ26Aug 5, 2024Updated last year
- Search any gif from Giphy APIβ12Oct 28, 2024Updated last year
- Official implementation of the paper "Evading Forensic Classifiers with Attribute-Conditioned Adversarial Faces" (CVPR 23)β46Jan 24, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Adds timm pretrained backbone to pytorch's FasterRcnn modelβ12Jan 25, 2024Updated 2 years ago
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech undβ¦β43Mar 12, 2023Updated 3 years ago
- TaiYiXLCheckpointLoader: An unoffical node support Taiyi-Diffusion-XL(Taiyi-XL) Chinese-English bilingual language modelβ11Sep 1, 2024Updated last year
- Fast kernel library for Diffusion inference with multiple compute backends.β103Jun 9, 2026Updated last week
- β10Sep 19, 2023Updated 2 years ago
- β24May 5, 2022Updated 4 years ago
- Model Fusion Based Prosody Predictionβ17Mar 18, 2018Updated 8 years ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2β114Aug 28, 2025Updated 9 months ago
- Code repository for the BMVC 2022 paper: Geometry Driven Progressive Warping for One Shot Face Animationβ12Jan 6, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Generate a UUID on all Django requests for traceabilityβ14Jul 31, 2018Updated 7 years ago
- Authors official PyTorch implementation of the "HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces" [ICβ¦β85Sep 28, 2023Updated 2 years ago
- AlphaFold2 and RoseTTAFold predictions of the SARS-CoV-2 B.1.1.529 variant Spike protein with HADDOCK antibody interactionsβ12Feb 10, 2023Updated 3 years ago
- This is the LAION repository for creating open super-resolution models with the help of LAION-5B subsets.β13May 9, 2022Updated 4 years ago
- β11Jul 31, 2022Updated 3 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official codeβ10Mar 8, 2022Updated 4 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to sβ¦β28Mar 14, 2023Updated 3 years ago
- Angular 15 Auth Boilerplate - Sign Up with Verification, Login and Forgot Passwordβ13Apr 28, 2023Updated 3 years ago
- Machine Translation from English to Odia language.β10Aug 9, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>β19Jan 23, 2022Updated 4 years ago
- Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving quβ¦β57Feb 5, 2026Updated 4 months ago
- Real-time 'Code Red' rocket alerts in Israelβ16Updated this week
- Torch-based tool for quantizing high-dimensional vectors using additive codebooksβ54May 25, 2022Updated 4 years ago
- A ComfyUI image generation integration for oobabooga's Text Generation WebUIβ15Aug 12, 2025Updated 10 months ago
- Personalization with deep learning in 100 lines of codeβ15Mar 31, 2023Updated 3 years ago
- ComfyUI ShadowR Wrapperβ16Feb 21, 2025Updated last year