MaxMax2016 / StreamingHiFiGANLinks

An Open-source Streaming High-fidelity Neural Audio Codec

☆11

Alternatives and similar repositories for StreamingHiFiGAN

Users that are interested in StreamingHiFiGAN are comparing it to the libraries listed below

Sorting:

p1an-lin-jung / wv_tts
☆19Updated last year
audiodemo / voice-conversion
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Updated last year
reppy4620 / convnext_tts
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆17Updated 8 months ago
lexkoro / cfm-vc
☆11Updated 4 months ago
shengcanxu / canoSpeech
text to speech
☆10Updated last year
jisang93 / VISinger
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆15Updated 2 years ago
huutuongtu / Lightvoc
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18Updated last year
b-sigpro / sed-hsmm
Onset-and-Offset-Aware Sound Event Detection
☆17Updated 5 months ago
liuhuang31 / g2pw_once
G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…
☆14Updated last year
CODEJIN / XiaoiceSing2
☆19Updated 2 years ago
anton-kashkin / hifi_vc
☆25Updated 2 years ago
meaningTeam / tidy-tunes
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆21Updated last week
mcf330 / efts2code
source code of EfficientTTS 2
☆14Updated last year
reppy4620 / x-vits
☆13Updated 8 months ago
ishine / Mutiband-HIFIGAN
Mutiband version of HIFIGAN
☆18Updated 4 years ago
Chengyuann / AutoStyle-TTS
Official PyTorch implementation of "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis…
☆14Updated 4 months ago
amphionspace / tts-evaluation
An evaluation set for large-scale trained TTS models (Coming in Sep 2024)
☆12Updated 10 months ago
cyhuang-tw / robust-vc
☆11Updated 3 years ago
shivammehta25 / BetterFastSpeech2
Just another FastSpeech 2 but cleaner code :)
☆26Updated last year
iamanigeeit / present
☆13Updated 10 months ago
ryota-komatsu / speech_resynth
Speech Resynthesis and Language Modeling
☆20Updated last month
ZehuaKcrissLi / GTR-Voice
☆13Updated 8 months ago
cpii-cai / PunCantonese
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆14Updated 7 months ago
ex3ndr / supervoice-hybrid
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Updated 11 months ago
ogunlao / glowtts_stdp
Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor
☆18Updated 2 years ago
liuhuang31 / Megatts2_HierSpeechpp
Megatts2 use HierSpeechpp's vocoder
☆18Updated 7 months ago
v-nhandt21 / MusicVoiceConversion
Sing any popular song with your voice
☆11Updated 3 years ago
miccio-dk / NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Updated 3 years ago
ArenAcikgoz / Whisper-Alignment
Forced alignment decoder for Whisper.
☆14Updated last year
pengzhendong / audio-pipeline
☆21Updated 8 months ago