LipNet with gluon
☆23Nov 22, 2022Updated 3 years ago
Alternatives and similar repositories for lipnet
Users that are interested in lipnet are comparing it to the libraries listed below
Sorting:
- "LipNet: End-to-End Sentence-level Lipreading" in PyTorch☆69Sep 9, 2019Updated 6 years ago
- Automated Lip Reading using Deep Reinforcement Learning☆32Jun 24, 2018Updated 7 years ago
- End-to-end pipeline for lip reading at the word level using a tensorflow CNN implementation.☆35Feb 15, 2020Updated 6 years ago
- The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxi…☆235Sep 21, 2022Updated 3 years ago
- A replication of Google DeepMind's paper End-to-End Sentence-level Lipreading☆28Sep 26, 2017Updated 8 years ago
- Chinese words classification using lipnet with pytorch☆40Nov 18, 2019Updated 6 years ago
- Code and model for paper <Mutual Information Maximization for Effective Lip Reading>☆19Sep 4, 2020Updated 5 years ago
- lip_reading_demo_net☆32Oct 22, 2019Updated 6 years ago
- ☆25Oct 28, 2020Updated 5 years ago
- End to End Multiview Lip Reading☆10Jan 26, 2018Updated 8 years ago
- Pytorch code for End-to-End Audiovisual Speech Recognition☆184Nov 18, 2022Updated 3 years ago
- 🎮 Use a Raspberry Pi to control a LoPy over UART☆12Mar 9, 2017Updated 9 years ago
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆94Jul 23, 2025Updated 7 months ago
- ☆65Oct 8, 2018Updated 7 years ago
- Color Coherence Vector is a powerful color-based image retrieval (Matlab)☆11Feb 27, 2015Updated 11 years ago
- There are many studies done to detect anomalies based on logs. Current approaches are mainly divided into three categories: supervised le…☆10Jan 10, 2022Updated 4 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Mar 23, 2018Updated 7 years ago
- Automated Lip reading from real-time videos in tensorflow in python☆164Mar 20, 2018Updated 8 years ago
- 这是一个Matlab代码,里面包括五种常见神经网络优化算法的对比。包括SGD、SGDM、Adagrad、AdaDelta、Adam☆11Mar 23, 2022Updated 3 years ago
- #DNN #CNN #LSTM #Classification #Sequential_data #Lip_reading☆28Jun 3, 2018Updated 7 years ago
- Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…☆14Jul 2, 2020Updated 5 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- ☆11Apr 12, 2024Updated last year
- ☆15Apr 27, 2017Updated 8 years ago
- Multimodal Speech Recognition for phoneme level prediction using Audio-Visual data from TCDTIMIT dataset implementing RNNs with LSTMs for…☆15Jul 27, 2023Updated 2 years ago
- Deep Learning Study with Gluon☆59Jun 3, 2018Updated 7 years ago
- This repository contains scripts for Human Activity Recognition (HAR) project☆15Jan 23, 2015Updated 11 years ago
- Jupyter notebooks and code for Intro to DL talk at Genesys☆14Aug 14, 2016Updated 9 years ago
- repository for converting SBD labels to SpaceNet version 2 labels☆12May 15, 2017Updated 8 years ago
- An instance segmentation challenge on Basketball images, with a particular focus on occlusion resolution. An opportunity to publish at MM…☆16Aug 8, 2023Updated 2 years ago
- ☆19Apr 1, 2022Updated 3 years ago
- Video Audio Translation Tool - automatically subtitles and dubs videos☆13Mar 16, 2020Updated 6 years ago
- Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection☆34Apr 18, 2022Updated 3 years ago
- deep multi-instance learning for rna protein binding prediction☆10May 21, 2017Updated 8 years ago
- ☆11Aug 6, 2019Updated 6 years ago
- End-To-End SpeechSynthesis system with knowledge distillation☆18Jul 16, 2022Updated 3 years ago
- Composable metric reporters in Python.☆14Jun 6, 2024Updated last year
- Lip Reading in the Wild using ResNet and LSTMs in PyTorch☆58Apr 23, 2018Updated 7 years ago
- ELECTRA MODEL NLP☆13Apr 8, 2020Updated 5 years ago