Unofficial PyTorch Implementation of StarGAN-ZSVC
☆14Aug 5, 2021Updated 4 years ago
Alternatives and similar repositories for stargan-zsvc
Users that are interested in stargan-zsvc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Voice conversion with just linear regression.☆37Sep 25, 2025Updated 6 months ago
- Non official project based on original /r/Deepfakes thread. Many thanks to him!☆15Feb 19, 2020Updated 6 years ago
- Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL☆10Aug 11, 2016Updated 9 years ago
- Training code and trained checkpoints for ASGAN.☆62Dec 27, 2023Updated 2 years ago
- Project repository for the work done in Triplet Entropy Loss: Improving The Generalization of Short Speech Language Identification Syst…☆13Feb 17, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Sep 27, 2023Updated 2 years ago
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- Representations of language in a model of visually grounded speech signal.☆23Apr 19, 2018Updated 8 years ago
- Official Repository of UltraVoice☆61Oct 28, 2025Updated 5 months ago
- Official Code for Assem-VC @ICASSP2022☆269May 16, 2022Updated 3 years ago
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆16Jul 12, 2021Updated 4 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆133Nov 29, 2023Updated 2 years ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- MarkMelGen is a Markov Melody Generation program that takes configuration, lyric, and example music files and creates a tune for the sup…☆14Jan 29, 2026Updated 2 months ago
- ☆19Feb 2, 2023Updated 3 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Mar 24, 2023Updated 3 years ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Aug 4, 2023Updated 2 years ago
- ☆22Mar 22, 2017Updated 9 years ago
- A Python3 program for converting Japanese words and numbers into phonemes.☆18Apr 24, 2018Updated 7 years ago
- Toolbox for easy and qualitative one-shot voice conversion☆46Dec 5, 2021Updated 4 years ago
- Natural Language Processing 817☆24Mar 12, 2026Updated last month
- Siamese neural networks for representation learning using Theano.☆20Oct 14, 2015Updated 10 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Converts Mandarin Chinese pinyin notation to IPA (international phonetic alphabet) notation☆18Nov 28, 2023Updated 2 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 5 months ago
- Voice conversion using deep adversarial learning☆17Oct 29, 2021Updated 4 years ago
- ☆21Jun 1, 2021Updated 4 years ago
- EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System☆15Mar 31, 2019Updated 7 years ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆27Aug 11, 2024Updated last year
- ☆21Apr 6, 2025Updated last year
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆143Sep 1, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Minimal module for computing audio spectrograms☆15Feb 28, 2019Updated 7 years ago
- Prosodic Speech Segmentation with Transformers☆26Feb 25, 2024Updated 2 years ago
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆83Jan 7, 2023Updated 3 years ago
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆147Jan 15, 2024Updated 2 years ago
- dinglingling, your program over!☆18Mar 27, 2020Updated 6 years ago
- ☆23Nov 25, 2025Updated 4 months ago
- Objective metrics used in several text-to-speech (TTS) papers.☆53Jun 17, 2025Updated 10 months ago