PKBHY / WaveFM

WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching

☆26

Related projects ⓘ

Alternatives and complementary repositories for WaveFM

cantabile-kwok / vec2wav2.0
Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995
☆49Updated this week
mcf330 / efts2code
source code of EfficientTTS 2
☆12Updated 9 months ago
francislata / unicats
An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".
☆22Updated last year
exercise-book-yq / Supercodec
☆42Updated last month
asuni / PitchSqueezer
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆34Updated 10 months ago
lucadellalib / discrete-wavlm-codec
A neural speech codec based on discrete WavLM representations
☆21Updated 2 months ago
shang0712 / HierTTS
☆44Updated last year
ftshijt / Interspeech2024_DiscreteSpeechChallenge
This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.
☆32Updated 9 months ago
gwh22 / LAFMA
☆34Updated 5 months ago
ogunlao / glowtts_stdp
Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor
☆18Updated last year
light1726 / SpeechTripleNet
The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"
☆29Updated 11 months ago
jisang93 / VISinger
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆15Updated last year
jjunak-yun / FLowHigh_code
Code for the paper "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"
☆19Updated 2 weeks ago
mubtasimahasan / DM-Codec
Source code for DM-Codec.
☆18Updated last month
haiciyang / LaDiffCodec
☆47Updated last week
alessandroragano / scoreq
SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)
☆38Updated last month
innnky / descript-audio-vae
VAE modified from Descript Audio Codec, which replaces the RVQ with VAE
☆54Updated 7 months ago
ex3ndr / supervoice-librilight-preprocessed
60k hours of phoneme-aligned audio from audio books
☆18Updated 3 months ago
p1an-lin-jung / wv_tts
☆19Updated 8 months ago
thuhcsi / SnakeGAN
Please visit https://thuhcsi.github.io/SnakeGAN/
☆36Updated last year
Ereboas / TacoLM
☆16Updated 6 months ago
slp-rl / SpokenStoryCloze
A spoken version of the textual story cloze benchmark
☆14Updated last year
huutuongtu / Lightvoc
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆16Updated 6 months ago
hhguo / SoCodec
Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications
☆63Updated 2 months ago
shivammehta25 / BetterFastSpeech2
Just another FastSpeech 2 but cleaner code :)
☆25Updated 4 months ago
fcumlin / DNSMOSPro
☆19Updated 2 months ago
ajaybati / miipher2.0
Reimplementation of Miipher
☆20Updated last year
Yip-Jia-Qi / codecformer
☆15Updated 4 months ago
AlanBaade / SyllableLM
Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models
☆35Updated last month
ex3ndr / supervoice-hybrid
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆27Updated 3 months ago