auspicious3000/contentvec

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/auspicious3000/contentvec)

auspicious3000 / contentvec

speech self-supervised representations

☆521

Alternatives and similar repositories for contentvec

Users that are interested in contentvec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bshall / soft-vc
View on GitHub
Soft speech units for voice conversion
☆455Mar 14, 2024Updated 2 years ago
bshall / hubert
View on GitHub
HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
☆405Oct 1, 2024Updated last year
adelacvg / NS2VC
View on GitHub
Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech
☆237Feb 29, 2024Updated 2 years ago
b04901014 / MQTTS
View on GitHub
☆260May 15, 2023Updated 3 years ago
Dream-High / RMVPE
View on GitHub
☆327Jan 25, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yangdongchao / SoundStorm
View on GitHub
The reproduced code for Google's SoundStorm
☆274Oct 7, 2023Updated 2 years ago
0nutation / USLM
View on GitHub
Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)
☆152Sep 14, 2023Updated 2 years ago
MelissaChen15 / control-vc
View on GitHub
This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"
☆133Nov 29, 2023Updated 2 years ago
biggytruck / SpeechSplit2
View on GitHub
Official implementation of SpeechSplit2
☆135Oct 22, 2022Updated 3 years ago
auspicious3000 / SpeechSplit
View on GitHub
Unsupervised Speech Decomposition Via Triple Information Bottleneck
☆697Oct 23, 2024Updated last year
chomeyama / SiFiGAN
View on GitHub
Official implementation of the source-filter HiFiGAN vocoder
☆274Jul 29, 2023Updated 2 years ago
jik876 / hifi-gan
View on GitHub
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
☆2,355Jul 27, 2024Updated last year
NVIDIA / BigVGAN
View on GitHub
Official PyTorch implementation of BigVGAN (ICLR 2023)
☆1,229Sep 5, 2024Updated last year
maum-ai / phaseaug
View on GitHub
ICASSP 2023 Accepted
☆191May 6, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ZhangXInFD / SpeechTokenizer
View on GitHub
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples a…
☆661Jun 9, 2024Updated 2 years ago
X-LANCE / UniCATS-CTX-vec2wav
View on GitHub
[AAAI 2024] Code for CTX-vec2wav in UniCATS
☆130Jun 11, 2024Updated 2 years ago
maxrmorrison / torchcrepe
View on GitHub
Pytorch implementation of the CREPE pitch tracker
☆522May 16, 2025Updated last year
innnky / ar-vits
View on GitHub
text to speech using autoregressive transformer and VITS
☆248Apr 3, 2024Updated 2 years ago
lesterphillip / SVCC23_FastSVC
View on GitHub
Singing Voice Conversion Challenge 2023 Starter Kit: FastSVC Reimplementation
☆116Nov 25, 2023Updated 2 years ago
facebookresearch / speech-resynthesis
View on GitHub
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…
☆416Aug 29, 2023Updated 2 years ago
sh-lee-prml / BigVGAN
View on GitHub
Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training
☆136Feb 18, 2023Updated 3 years ago
sony / bigvsan
View on GitHub
Pytorch implementation of BigVSAN
☆203Dec 9, 2025Updated 7 months ago
dhchoi99 / NANSY
View on GitHub
☆171Jul 25, 2022Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
X-LANCE / VoiceFlow-TTS
View on GitHub
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
☆375Sep 3, 2024Updated last year
auspicious3000 / AutoPST
View on GitHub
Global Rhythm Style Transfer Without Text Transcriptions
☆285Oct 23, 2024Updated last year
quickvc / QuickVC-VoiceConversion
View on GitHub
QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
☆261Jul 13, 2023Updated 3 years ago
yangdongchao / LLM-Codec
View on GitHub
The open source code for LLM-Codec
☆147Aug 18, 2024Updated last year
zhenye234 / CoMoSpeech
View on GitHub
ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
☆214Apr 26, 2024Updated 2 years ago
yangdongchao / UniAudio
View on GitHub
The Open Source Code of UniAudio
☆606Jul 22, 2024Updated last year
sp-nitech / diffsptk
View on GitHub
A differentiable version of SPTK
☆201Updated this week
yl4579 / StyleTTS-VC
View on GitHub
Official Implementation of StyleTTS-VC
☆199Jan 14, 2025Updated last year
innnky / descript-audio-vae
View on GitHub
VAE modified from Descript Audio Codec, which replaces the RVQ with VAE
☆92Apr 2, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hayeong0 / DDDM-VC
View on GitHub
Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for V…
☆244Jul 31, 2024Updated last year
yl4579 / PitchExtractor
View on GitHub
Deep Neural Pitch Extractor for Voice Conversion and TTS Training
☆151Aug 22, 2022Updated 3 years ago
liusongxiang / ppg-vc
View on GitHub
PPG-Based Voice Conversion
☆349Jul 22, 2022Updated 3 years ago
yangdongchao / AcademiCodec
View on GitHub
AcademiCodec: An Open Source Audio Codec Model for Academic Research
☆674Dec 27, 2023Updated 2 years ago
zhangyongmao / VISinger2
View on GitHub
VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
☆355Nov 4, 2024Updated last year
bshall / acoustic-model
View on GitHub
Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
☆104Mar 10, 2026Updated 4 months ago
CNChTu / FCPE
View on GitHub
☆201Oct 14, 2025Updated 9 months ago