unixpickle/vq-voice-swap

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/unixpickle/vq-voice-swap)

unixpickle / vq-voice-swap

Voice swapping with VQ-VAE and diffusion models

☆68

Alternatives and similar repositories for vq-voice-swap

Users that are interested in vq-voice-swap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
vivjay30 / pnf-sampling
View on GitHub
☆22Jun 8, 2021Updated 5 years ago
CSTR-Edinburgh / snickery
View on GitHub
Hybrid speech synthesiser
☆28Feb 18, 2019Updated 7 years ago
yoyolicoris / variational-diffwave
View on GitHub
☆32Jul 27, 2022Updated 4 years ago
archinetai / aligner-pytorch
View on GitHub
Sequence alignement methods with helpers for PyTorch.
☆24Nov 30, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
pabloppp / Arroz-Con-Cosas
View on GitHub
Experimental LDM uses of Paella's architecture
☆34Jan 26, 2023Updated 3 years ago
sushant-t / tts-trainer
View on GitHub
Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…
☆30May 27, 2023Updated 3 years ago
nii-yamagishilab / SpeechSPC-mini
View on GitHub
Speech Security and Privacy Compendium - Mini
☆10Jun 18, 2024Updated 2 years ago
vtuber-plan / vcvits
View on GitHub
Non Parallel Voice Conversion based on VITS
☆24Mar 31, 2023Updated 3 years ago
vtuber-plan / FlowVAE
View on GitHub
☆17Dec 12, 2023Updated 2 years ago
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
rishikksh20 / MiniMax-TTS-pytorch
View on GitHub
Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report
☆47Sep 2, 2025Updated 10 months ago
adobe-research / AutoToon
View on GitHub
☆25Mar 25, 2023Updated 3 years ago
SonyCSLParis / interactive-spectrogram-inpainting
View on GitHub
Implementation of the framework described in the paper Spectrogram Inpainting for Interactive Generation of Instrument Sounds published a…
☆40Oct 6, 2022Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
reppy4620 / x-vits
View on GitHub
☆14Aug 1, 2025Updated 11 months ago
lars76 / fastspeech2-clean
View on GitHub
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
☆18Aug 16, 2024Updated last year
seahore / PPG-GradVC
View on GitHub
A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis
☆45Jul 24, 2023Updated 3 years ago
duerig / StyleTTS2
View on GitHub
StyleTTS 2 Optimized Training Fork
☆32Feb 2, 2025Updated last year
hcy71o / AutoVocoder
View on GitHub
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
☆71Dec 2, 2022Updated 3 years ago
Aria-K-Alethia / speaking-rate-controllable-hifi-gan
View on GitHub
☆16Apr 4, 2022Updated 4 years ago
biboamy / instrument-disentangle
View on GitHub
☆23Aug 2, 2019Updated 6 years ago
jinny1208 / All-About-Speech
View on GitHub
☆14Apr 2, 2023Updated 3 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
shang0712 / HierTTS
View on GitHub
☆47Apr 16, 2023Updated 3 years ago
roger-tseng / CodecFake
View on GitHub
A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024
☆22Jul 27, 2024Updated 2 years ago
tuan3w / cnn_vocoder
View on GitHub
A fast cnn-based vocoder
☆78Jun 11, 2020Updated 6 years ago
k2kobayashi / crank
View on GitHub
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
☆171Jul 25, 2024Updated 2 years ago
roholazandie / ryan-tts
View on GitHub
☆18Jan 17, 2022Updated 4 years ago
KdaiP / conformer-RoPE
View on GitHub
Conformer block with Rotary Position Embedding, modified from lucidrains' implement
☆19Sep 13, 2024Updated last year
crowsonkb / cloob-training
View on GitHub
CLOOB training (JAX) and inference (JAX and PyTorch)
☆76May 16, 2022Updated 4 years ago
andabi / voice-disciminator
View on GitHub
A neural network for filtering target speaker's voice from audio written in tensorflow
☆21Jun 21, 2018Updated 8 years ago
lifeiteng / TTS-TextAnalyzer
View on GitHub
TTS Text Analyzer
☆31Jul 20, 2023Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
rishikksh20 / Zero-Shot-TTS
View on GitHub
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
☆34Sep 24, 2021Updated 4 years ago
adelacvg / DPTTS
View on GitHub
An AR+AR TTS attempt.
☆18Jan 13, 2025Updated last year
candlewill / RawNet
View on GitHub
RawNet: Fast End-to-End Neural Vocoder
☆43May 29, 2019Updated 7 years ago
y-chan / hifi-gan-misrnet
View on GitHub
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Mar 21, 2023Updated 3 years ago
johnryan465 / pscan
View on GitHub
☆40Jan 5, 2024Updated 2 years ago
gustavo-beck / wavebender-gan
View on GitHub
☆25Sep 27, 2022Updated 3 years ago
AeroScripts / HiddenEngrams
View on GitHub
Hidden Engrams: Long Term Memory for Transformer Model Inference
☆35Jun 26, 2021Updated 5 years ago