Top34051 / stargan-zsvcView external linksLinks
Unofficial PyTorch Implementation of StarGAN-ZSVC
☆14Aug 5, 2021Updated 4 years ago
Alternatives and similar repositories for stargan-zsvc
Users that are interested in stargan-zsvc are comparing it to the libraries listed below
Sorting:
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆16Jul 12, 2021Updated 4 years ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- Official Repository of UltraVoice☆58Oct 28, 2025Updated 3 months ago
- Non official project based on original /r/Deepfakes thread. Many thanks to him!☆15Feb 19, 2020Updated 5 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- A Python3 program for converting Japanese words and numbers into phonemes.☆18Apr 24, 2018Updated 7 years ago
- Converts Mandarin Chinese pinyin notation to IPA (international phonetic alphabet) notation☆18Nov 28, 2023Updated 2 years ago
- EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System☆15Mar 31, 2019Updated 6 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Mar 24, 2023Updated 2 years ago
- ☆19Feb 2, 2023Updated 3 years ago
- ☆21Jun 1, 2021Updated 4 years ago
- dinglingling, your program over!☆18Mar 27, 2020Updated 5 years ago
- Toolbox for easy and qualitative one-shot voice conversion☆46Dec 5, 2021Updated 4 years ago
- Voice conversion with just linear regression.☆33Sep 25, 2025Updated 4 months ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆134Nov 29, 2023Updated 2 years ago
- Official Code for Assem-VC @ICASSP2022☆269May 16, 2022Updated 3 years ago
- Hacking tools for Megadrive Streets of Rage game series☆11Jan 28, 2026Updated 2 weeks ago
- Objective metrics used in several text-to-speech (TTS) papers.☆52Jun 17, 2025Updated 8 months ago
- Implementation of "Audio Retrieval with Natural Language Queries", INTERSPEECH 2021, PyTorch☆26Aug 18, 2023Updated 2 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆25Aug 11, 2024Updated last year
- Speech synthesis using LPC☆23Jun 5, 2021Updated 4 years ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Aug 4, 2023Updated 2 years ago
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631☆23Aug 15, 2022Updated 3 years ago
- Prosodic Speech Segmentation with Transformers☆26Feb 25, 2024Updated last year
- ☆25Mar 12, 2022Updated 3 years ago
- Training code and trained checkpoints for ASGAN.☆62Dec 27, 2023Updated 2 years ago
- ☆23Dec 10, 2024Updated last year
- ☆25Apr 24, 2019Updated 6 years ago
- TTS Text Analyzer☆32Jul 20, 2023Updated 2 years ago
- Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.☆28Mar 3, 2022Updated 3 years ago
- demo page https://MingjieChen.github.io/dygan-vc☆67Apr 13, 2022Updated 3 years ago
- ☆112Jun 11, 2021Updated 4 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Sep 27, 2023Updated 2 years ago
- a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine☆41Jan 17, 2025Updated last year
- ☆26Sep 22, 2022Updated 3 years ago
- My vocoder experiments☆31Jul 26, 2025Updated 6 months ago