kamepong / ACVAE-VCLinks

☆10

Alternatives and similar repositories for ACVAE-VC

Users that are interested in ACVAE-VC are comparing it to the libraries listed below

Sorting:

WangHelin1997 / DuTa-VC
Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…
☆37Updated last year
ryuzho / DiffVC
Diffusion Model for Voice Conversion
☆17Updated 2 years ago
philgzl / brever
Speech enhancement in noisy and reverberant environments using deep neural networks
☆21Updated 3 weeks ago
yuwchen / BASPRO
☆11Updated 2 years ago
zy-du / Disentanglement-of-Emotional-Style-and-Speaker-Identity-for-Expressive-Voice-Conversion
This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…
☆20Updated last year
audiodemo / voice-conversion
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Updated last year
ogunlao / glowtts_stdp
Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor
☆18Updated 2 years ago
pengzhendong / streaming-vocos
Streaming Vocos
☆28Updated last month
sarulab-speech / spatial_voice_conversion
Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals
☆17Updated 11 months ago
hrnoh / f0-autovc
Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"
☆29Updated 4 years ago
lexkoro / cfm-vc
☆11Updated 4 months ago
meaningTeam / tidy-tunes
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆21Updated last week
hcy71o / SC-CNN
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems
☆39Updated last year
gteu / realtime-ppg-vc
Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.
☆28Updated 3 years ago
shivammehta25 / BetterFastSpeech2
Just another FastSpeech 2 but cleaner code :)
☆26Updated last year
chaufanglin / Normal2Whisper
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
☆11Updated 8 months ago
ex3ndr / supervoice-hybrid
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Updated 11 months ago
Arshdeep-Singh-Boparai / E-PANNs
☆13Updated last week
asuni / PitchSqueezer
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆34Updated last year
lucadellalib / discrete-wavlm-codec
A neural speech codec based on discrete WavLM representations
☆24Updated 10 months ago
OlaWod / PitchVC
PitchVC: Pitch Conditioned Any-to-Many Voice Conversion
☆33Updated last year
ga642381 / RobustVC
**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…
☆24Updated 2 years ago
cyhuang-tw / robust-vc
☆11Updated 3 years ago
AI-S2-Lab / GPT-Talker
[ACMMM'2024] Generative Expressive Conversational Speech Synthesis
☆36Updated 8 months ago
Mddct / usm-tokenizer
semantic tokenizer for speech and music
☆21Updated last week
Tikai7 / DiTTO-TTS
DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors
☆28Updated 5 months ago
yangdongchao / ALMTokenizer2
The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…
☆26Updated last month
francislata / unicats
An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".
☆26Updated last year
p0p4k / Matcha-TTS-2
E2E TTS using Conditional Flow Matching (Experimental*)
☆70Updated last year
IS2AI / KazEmoTTS
An open-source Kazakh Emotional Text-to-Speech Dataset
☆30Updated last year