espnet/interspeech2019-tutorial

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/espnet/interspeech2019-tutorial)

espnet / interspeech2019-tutorial

INTERSPEECH 2019 Tutorial Materials

☆194

Alternatives and similar repositories for interspeech2019-tutorial

Users that are interested in interspeech2019-tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

espnet / icassp2020-tts
View on GitHub
ESPnet-TTS Audio Sample HP
☆21Oct 25, 2019Updated 6 years ago
kan-bayashi / INTERSPEECH19_TUTORIAL
View on GitHub
Interspeech 2019 tutorial materials
☆49Sep 26, 2019Updated 6 years ago
jzlianglu / pykaldi2
View on GitHub
Yet another speech toolkit based on Kaldi and PyTorch
☆173Jul 1, 2020Updated 6 years ago
cywang97 / StreamingTransformer
View on GitHub
☆277Jan 15, 2021Updated 5 years ago
r9y9 / icassp2020-espnet-tts-merlin-baseline
View on GitHub
ICASSP 2020 ESPnet-TTS: Merlin baseline system
☆37Oct 28, 2019Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
YiwenShaoStephen / pychain
View on GitHub
PyTorch implementation of LF-MMI for End-to-end ASR
☆221Jan 14, 2021Updated 5 years ago
npuichigo / waveglow
View on GitHub
A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis
☆205Nov 6, 2018Updated 7 years ago
kaituoxu / Speech-Transformer
View on GitHub
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
☆810Apr 6, 2023Updated 3 years ago
nii-yamagishilab / TSNetVocoder
View on GitHub
☆42Oct 30, 2018Updated 7 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
tuan3w / cnn_vocoder
View on GitHub
A fast cnn-based vocoder
☆78Jun 11, 2020Updated 6 years ago
freewym / espresso
View on GitHub
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
☆939Sep 4, 2024Updated last year
espnet / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆9,897Updated this week
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mravanelli / pytorch-kaldi
View on GitHub
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…
☆2,398Mar 14, 2022Updated 4 years ago
nttcslab-sp / kaldiio
View on GitHub
A pure python module for reading and writing kaldi ark files
☆268Mar 6, 2025Updated last year
laboroai / LaboroTVSpeech
View on GitHub
☆90Mar 5, 2021Updated 5 years ago
Helsinki-NLP / prosody
View on GitHub
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
☆249Oct 30, 2019Updated 6 years ago
huiw39 / ExtensibleTTS-PyTorch
View on GitHub
An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery
☆26Jun 24, 2019Updated 7 years ago
xiph / LPCNet
View on GitHub
Efficient neural speech synthesis
☆1,218Sep 21, 2024Updated last year
ksw0306 / ClariNet
View on GitHub
A Pytorch Implementation of ClariNet
☆293Aug 5, 2019Updated 6 years ago
bshall / UniversalVocoding
View on GitHub
A PyTorch implementation of "Robust Universal Neural Vocoding"
☆238Nov 14, 2020Updated 5 years ago
jxzhanggg / nonparaSeq2seqVC_code
View on GitHub
Implementation code of non-parallel sequence-to-sequence VC
☆248Mar 24, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
open-speech / kaldi-io
View on GitHub
c++ Kaldi IO lib (static and dynamic).
☆25Nov 26, 2018Updated 7 years ago
wenet-e2e / speech-recognition-papers
View on GitHub
Towards hot directions in industrial end to end speech recognition
☆329Nov 30, 2021Updated 4 years ago
mkotha / WaveRNN
View on GitHub
A WaveRNN implementation
☆201Oct 14, 2019Updated 6 years ago
oxinabox / Kaldi-Notes
View on GitHub
Some notes on Kaldi
☆32Feb 20, 2015Updated 11 years ago
G-Wang / WaveRNN-Pytorch
View on GitHub
Fatcord's Alternative WaveRNN (Faster training)
☆125Mar 29, 2019Updated 7 years ago
YoavRamon / awesome-kaldi
View on GitHub
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
☆536Feb 9, 2022Updated 4 years ago
syang1993 / FFTNet
View on GitHub
A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder
☆94Jul 17, 2018Updated 8 years ago
hitachi-speech / EEND
View on GitHub
End-to-End Neural Diarization
☆435Aug 30, 2021Updated 4 years ago
speechio / BigCiDian
View on GitHub
Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
☆263Oct 11, 2019Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tianrengao / SqueezeWave
View on GitHub
☆254Jul 6, 2023Updated 3 years ago
facebookresearch / CPC_audio
View on GitHub
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
☆373Oct 12, 2021Updated 4 years ago
athena-team / athena-decoder
View on GitHub
☆76Mar 18, 2022Updated 4 years ago
aishell-foundation / DaCiDian
View on GitHub
DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)
☆301Jun 15, 2020Updated 6 years ago
nii-yamagishilab / multi-speaker-tacotron
View on GitHub
VCTK multi-speaker tacotron for ICASSP 2020
☆266Mar 29, 2022Updated 4 years ago
zcaceres / spec_augment
View on GitHub
🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆501Jun 11, 2021Updated 5 years ago
yanggeng1995 / FB-MelGAN
View on GitHub
A pytroch implementation of the FB-MelGAN
☆90May 26, 2020Updated 6 years ago