shehzeen / NeMoLinks

NeMo: a toolkit for conversational AI

☆9

Alternatives and similar repositories for NeMo

Users that are interested in NeMo are comparing it to the libraries listed below

Sorting:

simplespeech / simplespeechDemo
☆8Updated 11 months ago
adelacvg / DPTTS
An AR+AR TTS attempt.
☆16Updated 6 months ago
mushanshanshan / ESLTTS
ESLTTS dataset
☆16Updated 6 months ago
CODEJIN / XiaoiceSing2
☆19Updated 2 years ago
xcmyz / CLONE
☆20Updated 3 years ago
reppy4620 / vocoders
My vocoder experiments
☆30Updated 2 weeks ago
ex3ndr / supervoice-hybrid
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Updated last year
Tikai7 / DiTTO-TTS
DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors
☆28Updated 6 months ago
Adibian / ResGrad
Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
☆18Updated 6 months ago
declare-lab / HyperTTS
☆36Updated last year
naver-ai / RapFlow-TTS
☆38Updated 3 weeks ago
Infinity-INF / fast-phasr
Phonemes and durations labeling based on whisper small
☆11Updated last year
yoongi43 / VRVQ
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Updated 4 months ago
MaxMax2016 / max-vc
singing voice conversion without f0
☆23Updated 2 years ago
p1an-lin-jung / wv_tts
☆19Updated last year
vtuber-plan / FlowVAE
☆15Updated last year
ORI-Muchim / BEGANSing
BEGANSing - Korean SVS + SVC + AudioSR
☆11Updated last year
rishikksh20 / NU-Wave2-pytorch
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]
☆25Updated 3 years ago
shivammehta25 / Diff-TTSG
Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
☆39Updated last year
shivammehta25 / BetterFastSpeech2
Just another FastSpeech 2 but cleaner code :)
☆26Updated last year
MaxMax2016 / Glow-SVC
4G GPU & 10 Minutes for train
☆12Updated 2 years ago
PlayVoice / VI-Speaker
Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.
☆30Updated 2 years ago
choiHkk / Transformer-TTS-V2
☆25Updated last year
uthree / ddsp-vocoder
☆11Updated 9 months ago
ex3ndr / supervoice-gpt
GPT-style network for phonemization with durations of text
☆67Updated last year
y-chan / hifi-gan-misrnet
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Updated 2 years ago
SonyResearch / diffvox
Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"
☆29Updated 3 weeks ago
voidful / vall-e-encodec
☆41Updated 2 years ago
anton-kashkin / hifi_vc
☆25Updated 2 years ago
huutuongtu / Lightvoc
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18Updated last year