hgstudent / las

tf 2.0 implementation of Listen, attend and spell

☆21

Related projects: ⓘ

KimJeongSun / SpecAugment_numpy_scipy
fast SpecAugmentation code with numpy and scipy
☆29Updated 5 years ago
Kyubyong / specAugment
Tensor2tensor experiment with SpecAugment
☆47Updated 5 years ago
foamliu / Listen-Attend-Spell-v2
PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
☆38Updated 5 years ago
juanmc2005 / torch-plda
PyTorch implementation of PLDA as described in https://ravisoji.com/assets/papers/ioffe2006probabilistic.pdf
☆15Updated 3 years ago
TParcollet / E2E-SincNet
E2E-SincNet: Toward fully end-to-end speech recognition
☆29Updated 4 years ago
YoungloLee / tf2-speech-recognition-transformer
Tensorflow 2 Speech Recognition Code (Transformer)
☆25Updated 4 years ago
TeaPoly / Conformer-Athena
Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.
☆43Updated last year
1ytic / warp-rna
Recurrent Neural Aligner
☆49Updated 4 years ago
MarkWuNLP / SemanticMask
The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"
☆37Updated 4 years ago
joonson / voxsrc_2019
VoxSRC Challenge
☆31Updated 5 years ago
LeBenchmark / Interspeech2021
This repository describes our reproducible framework for assessing self-supervised representation learning from speech
☆51Updated 2 years ago
danFromTelAviv / key_words_spotting
implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"
☆35Updated 4 years ago
wangkenpu / rsrgan
Robust Speech Recognition Using Generative Adversarial Networks (GAN)
☆58Updated 4 years ago
lifelongeek / AAS_enhancement
This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…
☆28Updated 4 years ago
HaoranMiao / streaming-attention
streaming attention networks for end-to-end automatic speech recognition
☆55Updated 4 years ago
KrishnaDN / Keyword-Transformer
Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"
☆23Updated 3 years ago
YiwenShaoStephen / pychain_example
☆48Updated 3 years ago
TeaPoly / CTC-OptimizedLoss
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
☆56Updated last year
Dannynis / xvector_pytorch
A pytorch implementation of xvector embedding
☆78Updated 4 years ago
iamjanvijay / rnnt
An implementation of RNN-Transducer loss in TF-2.0.
☆45Updated last year
DemisEom / RNNT-pytorch
Implementaion RNN tranceducer
☆20Updated 5 years ago
iamjanvijay / rnnt_decoder_cuda
An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
☆65Updated 3 years ago
vinayak19th / ASR-Low-Resource
A Kaldi/ESPnet based approach to perform automatic speech recognition on low resource languages
☆9Updated 3 years ago
charlesliucn / awesome-end2end-speech-recognition
💬 A list of End-to-End speech recognition, including papers, codes and other materials
☆53Updated 5 years ago
cornerfarmer / ctc_segmentation
Segment a given audio into utterances using a trained end-to-end ASR model.
☆73Updated 3 years ago
JRMeyer / multi-task-kaldi
An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…
☆54Updated 4 years ago
xingchensong / Speech-Transformer-tf2.0
transformer for ASR-systerm (via tensorflow2.0)
☆113Updated 5 years ago
ynop / py-ctc-decode
CTC Decoder implementation with python only. Also supports language model decoding using KenLM.
☆36Updated 4 months ago
jonasvdd / TDNN
PyTorch implementation of a Time Delay Neural Network (TDNN)
☆40Updated 5 years ago
bond005 / vad
Various algorithms for voice activity detection
☆22Updated 7 years ago