pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper
β21Jun 23, 2022Updated 3 years ago
Alternatives and similar repositories for MultiSpeech
Users that are interested in MultiSpeech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SpeechPlus: Small LLM-Based Text-to-Speech Library πβ20May 20, 2025Updated 10 months ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.β16Mar 28, 2023Updated 3 years ago
- Implementation of Multi speaker TTSβ51Jan 2, 2021Updated 5 years ago
- Researchers who published code, models (in some cases), and demo apps (in few cases) along with their SOTA paperβ12Oct 19, 2023Updated 2 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networksβ17Aug 18, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRLβ101Mar 15, 2026Updated last month
- Supervoice diffusion enhanceβ28Jul 15, 2024Updated last year
- This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in β¦β57Aug 9, 2025Updated 8 months ago
- β54Jul 16, 2025Updated 9 months ago
- β82Jan 22, 2025Updated last year
- β25Mar 12, 2022Updated 4 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into oneβ26Aug 5, 2024Updated last year
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversionβ112Apr 1, 2024Updated 2 years ago
- PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paperβ12Mar 4, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β13Apr 14, 2024Updated 2 years ago
- β37May 8, 2021Updated 4 years ago
- Vox-Profile Benchmarkβ75Feb 16, 2026Updated 2 months ago
- Implementation of Emo-StarGANβ46Dec 19, 2023Updated 2 years ago
- Neural text to speech system that uses eSpeak as a text/phoneme front-endβ16Oct 20, 2021Updated 4 years ago
- β26Sep 22, 2022Updated 3 years ago
- ποΈ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets β¨β17May 20, 2025Updated 10 months ago
- text to speechβ10Mar 19, 2024Updated 2 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022β15Jun 18, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.β11Jan 11, 2020Updated 6 years ago
- Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUIπΈTTS(Text-to-Speech) based high performing neuralβ¦β44Aug 24, 2023Updated 2 years ago
- This is the implementation of the paper "VAW-GAN for Singing Voice Conversion withNon-parallel Training Data".β17Aug 12, 2020Updated 5 years ago
- A streamable speech recognition model with transformer encoders and RNN-T lossβ11Mar 1, 2021Updated 5 years ago
- [ICASSP 2020] Speech Emotion Recognition with Dual-Sequence LSTM Architectureβ12Jan 17, 2025Updated last year
- β22Apr 4, 2023Updated 3 years ago
- β38Nov 18, 2025Updated 5 months ago
- Voice Frameworkβ18Jan 21, 2026Updated 2 months ago
- β68Aug 16, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".β64Nov 5, 2025Updated 5 months ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/β34Mar 17, 2023Updated 3 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIANβ¦β74Sep 21, 2022Updated 3 years ago
- PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.β330Feb 9, 2024Updated 2 years ago
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.β13Oct 2, 2025Updated 6 months ago
- β18Aug 23, 2024Updated last year
- E2E TTS using Conditional Flow Matching (Experimental*)β71Nov 10, 2023Updated 2 years ago