TariqAHassan / HiFiHybridLinks

Hifi-like Vocoder implemented in PyTorch

☆13

Alternatives and similar repositories for HiFiHybrid

Users that are interested in HiFiHybrid are comparing it to the libraries listed below

Sorting:

uthree / ddsp-vocoder
☆10Updated 8 months ago
audiodemo / voice-conversion
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Updated last year
yoongi43 / VRVQ
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Updated 3 months ago
lexkoro / StyleTTS
☆11Updated 2 years ago
thuhcsi / PortableTTS
☆12Updated 2 years ago
cschaefer26 / StyleMelGAN
☆10Updated last year
shengcanxu / canoSpeech
text to speech
☆10Updated last year
b-sigpro / sed-hsmm
Onset-and-Offset-Aware Sound Event Detection
☆17Updated 5 months ago
ryota-komatsu / speech_resynth
Speech Resynthesis and Language Modeling
☆20Updated last month
arnabdas8901 / StarGAN-VC_PlusPlus
☆11Updated last year
Infinity-INF / fast-phasr
Phonemes and durations labeling based on whisper small
☆11Updated last year
ex3ndr / supervoice-vocoder
Production-ready vocoder using BigVSAN
☆11Updated last year
sarulab-speech / multi-speaker-dgp
Official implementation of DGP-based multi-speaker speech synthesis with PyTorch
☆24Updated 4 years ago
lexkoro / cfm-vc
☆11Updated 4 months ago
y-chan / hifi-gan-misrnet
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Updated 2 years ago
ex3ndr / supervoice-hybrid
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Updated 11 months ago
meaningTeam / tidy-tunes
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆21Updated last week
madhavlab / wav2tok
Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"
☆36Updated last year
Aria-K-Alethia / speaking-rate-controllable-hifi-gan
☆16Updated 3 years ago
Naozumi520 / g2pW-Cantonese
Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW
☆13Updated 7 months ago
sarulab-speech / spatial_voice_conversion
Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals
☆17Updated 11 months ago
karchkha / MelSpec_GPT_VQVAE
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Updated last year
Plachtaa / ASTRAL-quantization
speaker-disentangled speech linguistic content quantizer
☆21Updated 4 months ago
Respaired / RiFornet_Vocoder
a Neural Vocoder supporting Ring Attention, Conformer and NSF.
☆19Updated 5 months ago
kyegomez / Audio-xLSTMs
Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch
☆18Updated 2 weeks ago
amazon-science / iwslt-autodub-task
☆20Updated last year
nii-yamagishilab / speaker_sex_attribute_privacy
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Updated 2 years ago
zjlww / dsp
Digital Speech Processing in PyTorch.
☆14Updated 2 years ago
reppy4620 / x-vits
☆13Updated 8 months ago
iamanigeeit / present
☆13Updated 11 months ago