artem179 / WLASLinks

The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on pytorch.

☆11

Alternatives and similar repositories for WLAS

Users that are interested in WLAS are comparing it to the libraries listed below

Sorting:

kefirski / pytorch_TDNN
Time Delayed NN implemented in pytorch
☆81Updated 8 years ago
TomVeniat / SANAS
Stochastic Adaptive Neural Architecture Search
☆65Updated 6 years ago
pandeydivesh15 / AVSR-Deep-Speech
Google Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab
☆45Updated 7 years ago
ankitshah009 / WALNet-Weak_Label_Analysis
Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.
☆32Updated last year
craffel / mocha
Example implementation of Monotonic Chunkwise Attention.
☆52Updated 7 years ago
Yolanda-Gao / VoiceGAN
These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB
☆50Updated 6 years ago
geyang / char2wav_pytorch
pytorch implementation of lyre.ai's char2wav model
☆32Updated 8 years ago
swshon / dialectID_siam
Dialect identification using Siamese network
☆15Updated 7 years ago
lzuwei / end-to-end-multiview-lipreading
End to End Multiview Lip Reading
☆10Updated 7 years ago
transfer-learning-asr / transfer-learning-asr
Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017
☆46Updated 8 years ago
ttsunion / Deep-Expression
An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!
☆85Updated 4 years ago
dhgrs / pytorch-UniWaveNet
☆31Updated 6 years ago
AppleHolic / audioset_augmentor
Sound augmentation using Large-scale audio dataset (Audioset)
☆45Updated 4 years ago
PiotrTa / Huawei-Challenge-Speaker-Identification
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
☆36Updated 5 years ago
erogol / FFTNet
FFTNet vocoder implementation
☆81Updated 6 years ago
bioidiap / bob.bio.spear
Run speaker recognition algorithms - Mirrored from https://gitlab.idiap.ch/bob/bob.bio.spear
☆19Updated 2 years ago
mravanelli / theano-kaldi-rnn
THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is c…
☆33Updated 7 years ago
PengdaLiu / LAS-SpeechRecognition
Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).
☆32Updated 6 years ago
ajinkyaT / Lip_Reading_in_the_Wild_AVSR
Audio-Visual Speech Recognition using Deep Learning
☆60Updated 6 years ago
cdyangbo / end2endASR
implement end-to-end asr algorithm with tensorflow
☆40Updated 6 years ago
bootphon / features_extraction
audio cfeatures extraction tool from wav to h5features format
☆19Updated 6 years ago
hbredin / TristouNet
TristouNet: Triplet Loss for Speaker Turn Embedding
☆123Updated 8 years ago
arielephrat / vid2speech
Code for "Vid2speech: Speech Reconstruction from Silent Video" ICASSP '17
☆116Updated 8 years ago
nii-yamagishilab / tacotron2
An implementation of Tacotron and Tacotron2
☆81Updated 3 years ago
kaituoxu / Tacotron2
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…
☆52Updated 6 years ago
artbataev / end2end
Losses and decoders for end-to-end ASR and OCR
☆34Updated 4 years ago
LearnedVector / Wav2Letter
Speech Recognition model based off of FAIR research paper built using Pytorch.
☆84Updated 6 years ago
gorinars / dcase16-cnn
Sound event detection in real life audio with CNN submitted to DCASE16
☆22Updated 3 years ago
jcsilva / deep-clustering
☆70Updated 8 years ago
mravanelli / pytorch_MLP_for_ASR
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…
☆38Updated 7 years ago