xuyaoxun / MuCodec

☆41

Related projects ⓘ

Alternatives and complementary repositories for MuCodec

haiciyang / LaDiffCodec
☆47Updated 4 months ago
hhguo / SoCodec
Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications
☆61Updated last month
gwh22 / LAFMA
☆34Updated 4 months ago
innnky / descript-audio-vae
VAE modified from Descript Audio Codec, which replaces the RVQ with VAE
☆54Updated 7 months ago
justinlovelace / SESD
☆48Updated 2 weeks ago
yangdongchao / SimpleSpeech
The open source code for SimpleSpeech series
☆108Updated last month
cantabile-kwok / vec2wav2.0
Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995
☆45Updated this week
thuhcsi / SnakeGAN
Please visit https://thuhcsi.github.io/SnakeGAN/
☆36Updated last year
ftshijt / Interspeech2024_DiscreteSpeechChallenge
This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.
☆32Updated 9 months ago
BakerBunker / FreeV
[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
☆78Updated 4 months ago
shang0712 / HierTTS
☆44Updated last year
haoheliu / SemantiCodec
☆40Updated 5 months ago
thuhcsi / DiffVar
☆30Updated last year
Aria-K-Alethia / BigCodec
Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"
☆82Updated last month
hs-oh-prml / DiffProsody
☆62Updated last year
cpdu / unicats
☆62Updated 9 months ago
scutcsq / Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…
☆58Updated 7 months ago
hs-oh-prml / DurFlexEVC
☆50Updated 9 months ago
keonlee9420 / evaluate-zero-shot-tts
Evaluation Protocol for Large-Scale Zero-Shot TTS Literature
☆65Updated last month
asappresearch / simple-tts
Contains the code associated with the ICLR submission for our text-to-speech diffusion model
☆51Updated last year
3loi / NaturalVoices
☆46Updated last week
y-ren16 / TiCodec
☆53Updated 10 months ago
light1726 / SpeechTripleNet
The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"
☆29Updated 11 months ago
francislata / unicats
An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".
☆22Updated last year
cpdu / vallt
☆35Updated 9 months ago
alessandroragano / scoreq
SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)
☆37Updated 3 weeks ago
exercise-book-yq / Supercodec
☆40Updated 3 weeks ago
p0p4k / Matcha-TTS-2
E2E TTS using Conditional Flow Matching (Experimental*)
☆66Updated last year
soham97 / PAM
PAM is a no-reference audio quality metric for audio generation tasks
☆48Updated 3 months ago
maxrmorrison / promonet
Prosody and Pronunciation Modification Network
☆43Updated 3 months ago