auspicious3000/SpeechSplit-Demo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/auspicious3000/SpeechSplit-Demo)

auspicious3000 / SpeechSplit-Demo

Unsupervised Speech Decomposition via Triple Information Bottleneck

☆14

Alternatives and similar repositories for SpeechSplit-Demo

Users that are interested in SpeechSplit-Demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pc123455 / chorus-detection
View on GitHub
☆15Jul 30, 2017Updated 8 years ago
vBaiCai / vc_tacotron
View on GitHub
Voice Conversion using Tacotron.
☆11Dec 29, 2022Updated 3 years ago
ANLGBOY / WaveNODE
View on GitHub
Pytorch Implementation of WaveNODE
☆64Sep 4, 2020Updated 5 years ago
revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
rishikksh20 / PPSpeech
View on GitHub
PPSpeech: Phrase based Parallel End-to-End TTS System
☆35Aug 31, 2020Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Yoshifumi-Nakano / visual-text-to-speech
View on GitHub
visual-text to speech
☆14Apr 3, 2022Updated 4 years ago
numediart / LaughterSynthesis
View on GitHub
This repository contains laughter-related synthesis systems.
☆13Nov 7, 2020Updated 5 years ago
acetylSv / GST-tacotron
View on GitHub
Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…
☆61Jul 23, 2018Updated 8 years ago
ajinkyakulkarni14 / ERISHA
View on GitHub
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆44Dec 17, 2020Updated 5 years ago
karchkha / MelSpec_GPT_VQVAE
View on GitHub
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Oct 8, 2023Updated 2 years ago
nc-ai / speech
View on GitHub
☆17Aug 27, 2025Updated 10 months ago
keonlee9420 / Deep-Learning-TTS-Template
View on GitHub
This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).
☆14Jun 15, 2021Updated 5 years ago
geneing / WaveRNN-Pytorch
View on GitHub
Fatcord's Alternative WaveRNN (Faster training)
☆132Nov 29, 2020Updated 5 years ago
bshall / Tacotron
View on GitHub
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
☆115Dec 2, 2020Updated 5 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
li1jkdaw / LPCNet_parallel
View on GitHub
Simulation of parallel synthesis with LPCNet vocoder
☆14May 5, 2020Updated 6 years ago
emotiontts / emotiontts_open_db
View on GitHub
로봇의 감정 및 개성을 표현할 수 있는 대화형 음성합성 오픈소스 플랫폼
☆108Feb 5, 2025Updated last year
xinshengwang / ICASSP2021_paper_list-VC
View on GitHub
ICASSP 2021 accepted papers in term of voice conversion (VC)
☆18Apr 11, 2021Updated 5 years ago
jgarciapueyo / MelNet-SpeechGeneration
View on GitHub
Implementation of MelNet in PyTorch to generate high-fidelity audio samples
☆25Sep 16, 2020Updated 5 years ago
yanggeng1995 / Multi-band-WaveRNN
View on GitHub
☆45Dec 16, 2019Updated 6 years ago
resemble-ai / MelNet
View on GitHub
WIP: Open Source Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"
☆257Aug 9, 2019Updated 6 years ago
bastibe / MAPS-Scripts
View on GitHub
A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.
☆25Mar 29, 2021Updated 5 years ago
ViEm-ccy / GEDLoss_pytorch
View on GitHub
a pytorch implementation of Google GEDLoss
☆32Dec 9, 2020Updated 5 years ago
LCF2764 / autoKWS2021_1st_solution
View on GitHub
Auto-KWS 2021 Challenge 1st place solution.
☆11Jul 20, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
auspicious3000 / SpeechSplit
View on GitHub
Unsupervised Speech Decomposition Via Triple Information Bottleneck
☆697Oct 23, 2024Updated last year
lucidrains / tranception-pytorch
View on GitHub
Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction
☆32Jun 19, 2022Updated 4 years ago
pgys / NoIze
View on GitHub
A selective noise filter architecture driven by a CNN and Wiener filter
☆17Nov 21, 2019Updated 6 years ago
jongwook / crepe
View on GitHub
☆12Jun 5, 2018Updated 8 years ago
sarulab-speech / lightweight_spkr_anon
View on GitHub
Lightweight speaker anonymization [IEEE SLT2021]
☆27Jun 6, 2022Updated 4 years ago
xk-wang / MusicYOLO
View on GitHub
MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.
☆11Jan 29, 2022Updated 4 years ago
keonlee9420 / Robust_Fine_Grained_Prosody_Control
View on GitHub
PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis
☆41Feb 20, 2022Updated 4 years ago
alishdipani / Neural-Style-Transfer-Audio
View on GitHub
This is PyTorch Implementation of Neural Style Transfer Algorithm which is modified for Audios.
☆85Mar 16, 2022Updated 4 years ago
ericwudayi / SkipVQVC
View on GitHub
An implementation of SkipVQVC with various settings.
☆75Jun 22, 2020Updated 6 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
nii-yamagishilab / vctk-silence-labels
View on GitHub
☆25Oct 4, 2022Updated 3 years ago
VKW2021 / kaldi-baseline
View on GitHub
kaldi cnn-tdnnf baseline
☆13Aug 31, 2021Updated 4 years ago
midas-research / speechmix
View on GitHub
☆12Oct 2, 2020Updated 5 years ago
yunyikristy / ttsGAN-ICLR2019
View on GitHub
☆25Apr 24, 2019Updated 7 years ago
KunZhou9646 / Speaker-independent-emotional-voice-conversion-based-on-conditional-VAW-GAN-and-CWT
View on GitHub
This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…
☆90Nov 13, 2020Updated 5 years ago
xushengyuan / VocalnetOpenDataset
View on GitHub
一个开源的中文歌声合成数据集。An open-source Chinese singing synthesizing dataset.
☆24Jul 13, 2019Updated 7 years ago
keonlee9420 / STYLER
View on GitHub
Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…
☆159Jun 5, 2025Updated last year