msalhab96 / MultiSpeechView external linksLinks
pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper
β21Jun 23, 2022Updated 3 years ago
Alternatives and similar repositories for MultiSpeech
Users that are interested in MultiSpeech are comparing it to the libraries listed below
Sorting:
- SpeechPlus: Small LLM-Based Text-to-Speech Library πβ20May 20, 2025Updated 8 months ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networksβ17Aug 18, 2023Updated 2 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.β16Mar 28, 2023Updated 2 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022β15Jun 18, 2022Updated 3 years ago
- Implementation of Multi speaker TTSβ51Jan 2, 2021Updated 5 years ago
- Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUIπΈTTS(Text-to-Speech) based high performing neuralβ¦β42Aug 24, 2023Updated 2 years ago
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRLβ101Jun 26, 2024Updated last year
- β52Jul 16, 2025Updated 6 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into oneβ26Aug 5, 2024Updated last year
- Supervoice diffusion enhanceβ28Jul 15, 2024Updated last year
- Vox-Profile Benchmarkβ67Sep 12, 2025Updated 5 months ago
- MSP-Podcast Challenge Baseline Code for Interspeech 2025β28Dec 4, 2024Updated last year
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using aβ¦β12Mar 24, 2023Updated 2 years ago
- text to speechβ10Mar 19, 2024Updated last year
- β32Nov 18, 2025Updated 2 months ago
- Researchers who published code, models (in some cases), and demo apps (in few cases) along with their SOTA paperβ12Oct 19, 2023Updated 2 years ago
- Onset-and-Offset-Aware Sound Event Detectionβ20Feb 10, 2025Updated last year
- A Model (maybe an app) that translates the audio of a video from one language to another language, cloning the voice of original video wiβ¦β15May 19, 2025Updated 8 months ago
- A Grapheme to Phoneme model using LSTM implemented in pytorchβ13Jul 6, 2022Updated 3 years ago
- β13Apr 14, 2024Updated last year
- Neural text to speech system that uses eSpeak as a text/phoneme front-endβ16Oct 20, 2021Updated 4 years ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrievalβ13Jun 27, 2025Updated 7 months ago
- β25Mar 12, 2022Updated 3 years ago
- TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.β11Jan 11, 2020Updated 6 years ago
- PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paperβ12Mar 4, 2022Updated 3 years ago
- β13Jan 5, 2025Updated last year
- β14Aug 1, 2025Updated 6 months ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognitionβ11Mar 14, 2025Updated 11 months ago
- β11Mar 22, 2023Updated 2 years ago
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.β13Oct 2, 2025Updated 4 months ago
- DysfluentWFSTβ17Nov 13, 2025Updated 3 months ago
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.β25Dec 11, 2025Updated 2 months ago
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speechβ11Jun 30, 2023Updated 2 years ago
- β14Jun 16, 2023Updated 2 years ago
- β82Jan 22, 2025Updated last year
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation suppβ¦β48Jul 31, 2023Updated 2 years ago
- Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,β¦β27Nov 13, 2025Updated 3 months ago
- This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in β¦β57Aug 9, 2025Updated 6 months ago
- This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".β63Nov 5, 2025Updated 3 months ago