Top34051/stargan-zsvc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Top34051/stargan-zsvc)

Top34051 / stargan-zsvc

Unofficial PyTorch Implementation of StarGAN-ZSVC

☆14

Alternatives and similar repositories for stargan-zsvc

Users that are interested in stargan-zsvc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kvrooman / faceswap_
View on GitHub
Non official project based on original /r/Deepfakes thread. Many thanks to him!
☆15Feb 19, 2020Updated 6 years ago
jacquelineCelia / lexicon_discovery
View on GitHub
Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL
☆10Aug 11, 2016Updated 9 years ago
ruanvdmerwe / triplet-entropy-loss
View on GitHub
Project repository for the work done in Triplet Entropy Loss: Improving The Generalization of Short Speech Language Identification Syst…
☆13Feb 17, 2021Updated 5 years ago
RF5 / simple-asgan
View on GitHub
Training code and trained checkpoints for ASGAN.
☆62Dec 27, 2023Updated 2 years ago
RF5 / transfusion-asr
View on GitHub
Transcribing Speech with Multinomial Diffusion, training code and models.
☆80Sep 27, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Infinity-INF / fast-phasr
View on GitHub
Phonemes and durations labeling based on whisper small
☆11Jul 7, 2024Updated 2 years ago
torbsto / kafka-salsa
View on GitHub
☆10Dec 3, 2020Updated 5 years ago
kamperh / linearvc
View on GitHub
Voice conversion with just linear regression.
☆37Sep 25, 2025Updated 10 months ago
gchrupala / visually-grounded-speech
View on GitHub
Representations of language in a model of visually grounded speech signal.
☆23Apr 19, 2018Updated 8 years ago
maum-ai / assem-vc
View on GitHub
Official Code for Assem-VC @ICASSP2022
☆269May 16, 2022Updated 4 years ago
shashankshirol / GeneratingNoisySpeechData
View on GitHub
A repository comprising of code for generation of noisy speech data from clean data using deep learning methods
☆16Jul 12, 2021Updated 5 years ago
MelissaChen15 / control-vc
View on GitHub
This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"
☆132Nov 29, 2023Updated 2 years ago
hcy71o / SC-CNN
View on GitHub
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems
☆39Nov 1, 2023Updated 2 years ago
karchkha / MelSpec_GPT_VQVAE
View on GitHub
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Oct 8, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
RedFerret61 / MarkMelGen
View on GitHub
MarkMelGen is a Markov Melody Generation program that takes configuration, lyric, and example music files and creates a tune for the sup…
☆17Jan 29, 2026Updated 6 months ago
CODEJIN / XiaoiceSing2
View on GitHub
☆19Feb 2, 2023Updated 3 years ago
shaojinding / Adversarial-Many-to-Many-VC
View on GitHub
[InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …
☆39Mar 24, 2023Updated 3 years ago
kamperh / recipe_swbd_wordembeds
View on GitHub
☆22Mar 22, 2017Updated 9 years ago
JRMeyer / jphones
View on GitHub
A Python3 program for converting Japanese words and numbers into phonemes.
☆18Apr 24, 2018Updated 8 years ago
lifeiteng / VoiceBox
View on GitHub
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
☆29Aug 4, 2023Updated 2 years ago
SolomidHero / real-time-voice-conversion
View on GitHub
Toolbox for easy and qualitative one-shot voice conversion
☆48Dec 5, 2021Updated 4 years ago
kamperh / nlp817
View on GitHub
Natural Language Processing 817
☆24Mar 12, 2026Updated 4 months ago
kamperh / couscous
View on GitHub
Siamese neural networks for representation learning using Theano.
☆20Oct 14, 2015Updated 10 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Connum / npm-pinyin2ipa
View on GitHub
Converts Mandarin Chinese pinyin notation to IPA (international phonetic alphabet) notation
☆19Nov 28, 2023Updated 2 years ago
hubertsiuzdak / voice-conversion
View on GitHub
Voice conversion using deep adversarial learning
☆17Oct 29, 2021Updated 4 years ago
openaudiolab / LLaST
View on GitHub
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
☆26Aug 11, 2024Updated last year
Nathan-Roll1 / PSST
View on GitHub
Prosodic Speech Segmentation with Transformers
☆28Feb 25, 2024Updated 2 years ago
WelkinYang / EMPHASIS-pytorch
View on GitHub
EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System
☆15Mar 31, 2019Updated 7 years ago
grtzsohalf / SpeechNet-codebase
View on GitHub
☆21Jun 1, 2021Updated 5 years ago
bshall / VectorQuantizedCPC
View on GitHub
Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion
☆142Sep 1, 2020Updated 5 years ago
hhguo / WaveRNN
View on GitHub
Based on https://github.com/fatchord/WaveRNN
☆24May 3, 2020Updated 6 years ago
b04901014 / UUVC
View on GitHub
Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…
☆83Jan 7, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
mutiann / neural-lexicon-reader
View on GitHub
Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge
☆21Jul 25, 2022Updated 4 years ago
hertz-pj / dinglingling
View on GitHub
dinglingling, your program over!
☆18Mar 27, 2020Updated 6 years ago
IDEA-Emdoor-Lab / DistilCodec
View on GitHub
A Neural Audio Codec (NAC) for Universal Audio
☆47May 30, 2025Updated last year
winddori2002 / TriAAN-VC
View on GitHub
TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion
☆146Jan 15, 2024Updated 2 years ago
jefflai108 / Semi-Supervsied-Spoken-Language-Understanding-PyTorch
View on GitHub
Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining
☆12Mar 23, 2021Updated 5 years ago
msamribeiro / deep-cca
View on GitHub
Deep Canonical Correlation Analysis (DCCA) implementation using Theano
☆25Jan 9, 2017Updated 9 years ago
AI-Unicamp / TTS-Objective-Metrics
View on GitHub
Objective metrics used in several text-to-speech (TTS) papers.
☆54Jun 17, 2025Updated last year