sooftware / deepspeech2Links

PyTorch implementation of "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin" (ICML, 2016)

☆23

Alternatives and similar repositories for deepspeech2

Users that are interested in deepspeech2 are comparing it to the libraries listed below

Sorting:

sooftware / speech-transformer
Transformer implementation speciaized in speech recognition tasks using Pytorch.
☆64Updated 3 years ago
sooftware / End-to-End-Speech-Recognition-Models
PyTorch implementation of automatic speech recognition models.
☆38Updated 4 years ago
sooftware / jasper
PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)
☆32Updated 4 years ago
upskyy / ContextNet
PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…
☆38Updated 3 years ago
sooftware / Naver-AI-Hackathon-Speech
2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib
☆22Updated 4 years ago
sooftware / openspeech
Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
☆35Updated 3 years ago
sooftware / Fairseq-Listen-Attend-Spell
A Fairseq implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
☆11Updated 4 years ago
speech-paper-reading / speech-paper-reading
Repository for speech paper reading
☆33Updated 3 years ago
zldzmfoq12 / VCtube
A pakage for crawling audio from Youtube
☆42Updated last year
upskyy / Automatic-Speech-Recognition-Models
End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
☆10Updated 3 years ago
sooftware / RNN-Transducer
PyTorch implementation of RNN-Transducer(RNN-T).
☆76Updated 4 years ago
seongmin-kye / meta-SR
Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)
☆74Updated 4 years ago
dobby-seo / Pytorch-MHAtt-RNN-KWS
Multi-Head-Attention RNN pytorch implement for keyword spotting
☆21Updated 4 years ago
sooftware / seq2seq
PyTorch implementation of the RNN-based sequence-to-sequence architecture.
☆22Updated 4 years ago
ranchlai / speaker-verification
Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN
☆91Updated 3 years ago
upskyy / Squeezeformer
PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)
☆141Updated 2 years ago
JoungheeKim / K-wav2vec
☆85Updated 2 years ago
fd873630 / RNN-Transducer
RNN-Transducer for korean
☆42Updated 4 years ago
juanmc2005 / SpeakerEmbeddingLossComparison
Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…
☆60Updated 4 years ago
AI-Research-BD / Keyword-MLP
Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.
☆15Updated 2 years ago
kaistmm / Metric-UD-KWS
Official code for Metric learning for user-defined keyword spotting
☆31Updated last year
joonson / voxceleb_unsupervised
Augmentation adversarial training for self-supervised speaker recognition
☆77Updated 3 years ago
sooftware / tacotron2
Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.
☆19Updated 4 years ago
HanSeokhyeon / Deep_learning_for_Phoneme_recognition
다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.
☆13Updated 5 years ago
lightning830 / E2E-audio-speech-recognition
Conformer encoder + Transformer decoder with Hybrid CTC/attention
☆12Updated 3 years ago
ynop / py-ctc-decode
CTC Decoder implementation with python only. Also supports language model decoding using KenLM.
☆37Updated last year
sooftware / lightning-asr
Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
☆45Updated 4 years ago
dobby-seo / korean-speech-recognition-quartznet
Jasper 기반 양자화된 모델인 Quartznet 한국어 음성인식
☆21Updated 3 years ago
lorenlugosch / transducer-tutorial
Example code for a neural transducer model.
☆61Updated last year
skgusrb12 / voice_activity_detection
Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)
☆26Updated 4 years ago