lzuwei/end-to-end-multiview-lipreading

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lzuwei/end-to-end-multiview-lipreading)

lzuwei / end-to-end-multiview-lipreading

End to End Multiview Lip Reading

☆10

Alternatives and similar repositories for end-to-end-multiview-lipreading

Users that are interested in end-to-end-multiview-lipreading are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

georgesterpu / pyVSR
View on GitHub
Python toolkit for Visual Speech Recognition
☆37Jun 10, 2020Updated 6 years ago
lzuwei / ip-avsr
View on GitHub
Audio Visual Speech Recognition
☆23Aug 9, 2017Updated 8 years ago
artem179 / WLAS
View on GitHub
The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…
☆11Mar 23, 2018Updated 8 years ago
SSahuDS / Lipreading-Using-Mutimodal-Speech-Recognition
View on GitHub
Multimodal Speech Recognition for phoneme level prediction using Audio-Visual data from TCDTIMIT dataset implementing RNNs with LSTMs for…
☆15Jul 27, 2023Updated 3 years ago
LeeYongHyeok / DCM_vgg_transformer
View on GitHub
Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…
☆14Jul 2, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
mpc001 / end-to-end-lipreading
View on GitHub
Pytorch code for End-to-End Audiovisual Speech Recognition
☆183Nov 18, 2022Updated 3 years ago
DinoMan / face-processor
View on GitHub
Aligns faces to the canonical face in both videos and images
☆17Apr 11, 2022Updated 4 years ago
xing96 / MIM-lipreading
View on GitHub
Code and model for paper <Mutual Information Maximization for Effective Lip Reading>
☆19Sep 4, 2020Updated 5 years ago
afperezm / acoustic-images-distillation
View on GitHub
Code for the paper: Audio-Visual Model Distillation Using Acoustic Images
☆21Mar 24, 2023Updated 3 years ago
lelechen63 / 3d_gan
View on GitHub
☆34Jul 25, 2018Updated 8 years ago
ms-dot-k / Visual-Audio-Memory
View on GitHub
PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)
☆22Apr 11, 2022Updated 4 years ago
ajinkyaT / Lip_Reading_in_the_Wild_AVSR
View on GitHub
Audio-Visual Speech Recognition using Deep Learning
☆61Nov 14, 2018Updated 7 years ago
eastonYi / end-to-end_asr_pytorch
View on GitHub
Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch
☆23Jul 28, 2020Updated 6 years ago
georgesterpu / avsr-tf1
View on GitHub
Audio-Visual Speech Recognition using Sequence to Sequence Models
☆84Jul 10, 2020Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Bucknalla / lopy-raspberrypi
View on GitHub
🎮 Use a Raspberry Pi to control a LoPy over UART
☆12Mar 9, 2017Updated 9 years ago
prajwalkr / transpotter
View on GitHub
Official implementation of Transpotter, published in BMVC 2021
☆16Aug 6, 2022Updated 3 years ago
wuyinwuxian / Neural_Network_optimization_method
View on GitHub
这是一个Matlab代码，里面包括五种常见神经网络优化算法的对比。包括SGD、SGDM、Adagrad、AdaDelta、Adam
☆11Mar 23, 2022Updated 4 years ago
jarret / raspi-uart-waveshare
View on GitHub
A library for interfacing with the 4.3inch UART e-Paper from a Raspberry Pi 2/3 via Python3 with example programs to display QR Codes for…
☆12Mar 9, 2019Updated 7 years ago
TarekVito / ColorCoherenceVector
View on GitHub
Color Coherence Vector is a powerful color-based image retrieval (Matlab)
☆11Feb 27, 2015Updated 11 years ago
lshiwjx / deformable-3d-convnets
View on GitHub
Deformable 3D ConvNets for Action Recognition
☆10Jan 21, 2018Updated 8 years ago
saschaschramm / MonteCarloTreeSearch
View on GitHub
This project applies Monte Carlo Tree Search (MCTS) to a simple grid world.
☆10May 30, 2018Updated 8 years ago
simgunz / viterbi-decoder
View on GitHub
A matlab+mex implementation of a convolutional encoder and a Viterbi decoder
☆13May 1, 2012Updated 14 years ago
klauscc / lipnet-replication
View on GitHub
A replication of Google DeepMind's paper End-to-End Sentence-level Lipreading
☆28Sep 26, 2017Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Li-Sanze / ID-Card
View on GitHub
给定一张身份证正、反面，识别身份证上的所有文字信息
☆10Sep 4, 2019Updated 6 years ago
tsiangleo / TensorFlowMnist
View on GitHub
☆15Apr 27, 2017Updated 9 years ago
matthijsvk / multimodalSR
View on GitHub
Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.
☆69Nov 19, 2022Updated 3 years ago
smeetrs / deep_avsr
View on GitHub
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
☆244Feb 15, 2024Updated 2 years ago
lilianemomeni / KWS-Net
View on GitHub
Seeing Wake Words: Audio-visual Keyword Spotting
☆67Sep 16, 2020Updated 5 years ago
ski-net / lipnet
View on GitHub
LipNet with gluon
☆23Nov 22, 2022Updated 3 years ago
itsyoavshalev / End-to-End-Lip-Synchronization-with-a-Temporal-AutoEncoder
View on GitHub
☆22Mar 31, 2022Updated 4 years ago
WisleyWang / DC-AI-LipReading
View on GitHub
☆11May 31, 2020Updated 6 years ago
ljw20155136 / Lip-reading-by-CNN-and-LSTM-architecture
View on GitHub
#DNN #CNN #LSTM #Classification #Sequential_data #Lip_reading
☆28Jun 3, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
luanshiyinyang / ChineseOCR
View on GitHub
端到端的中文场景文字识别。
☆12Jun 27, 2022Updated 4 years ago
Lenvia / RBM-BP-character-recognition
View on GitHub
RBM+BP神经网络识别手写数字和英文字符
☆11Mar 25, 2023Updated 3 years ago
danisbet / machine-lip-reading
View on GitHub
Using an LSTM and 4d convolutional network for lip reading
☆12May 11, 2018Updated 8 years ago
arielephrat / vid2speech
View on GitHub
Code for "Vid2speech: Speech Reconstruction from Silent Video" ICASSP '17
☆115Feb 15, 2017Updated 9 years ago
ichn-hu / DSP-Audio-Collector
View on GitHub
Web app created to collect audios for course project
☆10Apr 6, 2018Updated 8 years ago
Zhong-master / PocketSphinx_Speech_Recognition
View on GitHub
PocketSphinx_Speech_Recognition
☆10Aug 5, 2021Updated 4 years ago
Boyu1997 / mcts-travel-salesman
View on GitHub
Monte Carlo tree search (MCTS) on traveling salesman problem (TSP)
☆22Apr 27, 2019Updated 7 years ago