niffler92 / viterbi-pythonLinks

Viterbi algorithm for Automatic Speech Recognition

☆9

Alternatives and similar repositories for viterbi-python

Users that are interested in viterbi-python are comparing it to the libraries listed below

Sorting:

ex3ndr / supervoice-gpt-facodec
GPT for FACodec
☆13Updated last year
speechnovateur / languagecodec_tmp
Temporary anonymous version
☆22Updated last year
desh2608 / kaldi
This is now the official location of the Kaldi project.
☆8Updated 3 years ago
yunyikristy / ttsGAN-ICLR2019
☆25Updated 6 years ago
cschaefer26 / StyleMelGAN
☆10Updated last year
AppleHolic / FastSpeech2
Refactored version of https://github.com/ming024/FastSpeech2
☆14Updated 3 years ago
rishikksh20 / Zero-Shot-TTS
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
☆34Updated 3 years ago
xcmyz / Tacotron2-Pytorch
follow NVIDIA, simplify it and support data parallel.
☆13Updated 5 years ago
luomingshuang / k2-speechbrain
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Updated 2 years ago
sarulab-speech / multi-speaker-dgp
Official implementation of DGP-based multi-speaker speech synthesis with PyTorch
☆24Updated 4 years ago
ivanvovk / compressed-tacotron2-pytorch
Compressed version of Tacotron 2 using Tensor Train + Waveglow.
☆22Updated 5 years ago
revsic / torch-whisper-guided-vc
Torch implementation of Whisper-guided DDPM based Voice Conversion
☆49Updated 2 years ago
wenet-e2e / WeSpeech-AI
Open Source Speech/Text Data on AI
☆18Updated 2 years ago
ndkgit339 / spe-dss
Speech Parameter Estimation Using Differentiable Speech Synthesizer
☆43Updated 2 years ago
hhguo / WaveRNN
Based on https://github.com/fatchord/WaveRNN
☆24Updated 5 years ago
ljuvela / GELP
☆26Updated 4 years ago
r9y9 / kiritan_singing
Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.
☆29Updated last year
choiHkk / FastSpeech2-cwt
with alignment learning and continuous wavelet transform
☆21Updated 2 years ago
vliu15 / adversarial-tts
End-to-end Text-to-Speech with Generative Adversarial Networks
☆20Updated 4 years ago
seungheondoh / hi_kia
wake-up word emotion recognition [APSIPA 2022]
☆17Updated 2 years ago
samsad35 / source-filter-vae
Learning and controlling the source-filter representation of speech with a variational autoencoder
☆45Updated 2 years ago
rishikksh20 / UnivNet-pytorch
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation
☆74Updated 3 years ago
NeuroWave-ai / CUCVAE-TTS
☆25Updated 3 years ago
SpeechColab / PySpeechColab
A library of speech gadgets.
☆13Updated 2 years ago
Aria-K-Alethia / speaking-rate-controllable-hifi-gan
☆16Updated 3 years ago
rhoposit / icassp2021
☆15Updated 4 years ago
Edresson / GE2E-Speaker-Encoder
GE2E Speaker Encoder - Generalized End-To-End Loss for Speaker Verification
☆13Updated 5 years ago
andrebola / contrastive-mir-learning
This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"
☆15Updated last year
rishikksh20 / iSTFT-Avocodo-pytorch
Ultrafast GAN based Vocoder for Text to Speech
☆50Updated 2 years ago
ajinkyakulkarni14 / ERISHA
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆43Updated 4 years ago