Using an LSTM and 4d convolutional network for lip reading
☆12May 11, 2018Updated 7 years ago
Alternatives and similar repositories for machine-lip-reading
Users that are interested in machine-lip-reading are comparing it to the libraries listed below
Sorting:
- End to End Multiview Lip Reading☆10Jan 26, 2018Updated 8 years ago
- A Keras implementation of LipNet☆24Oct 30, 2018Updated 7 years ago
- A replication of Google DeepMind's paper End-to-End Sentence-level Lipreading☆28Sep 26, 2017Updated 8 years ago
- ☆10Feb 19, 2021Updated 5 years ago
- ☆11Jan 20, 2017Updated 9 years ago
- ☆13Oct 25, 2024Updated last year
- A modular, scalable, fast and reliable phishing detection framework☆11Dec 1, 2018Updated 7 years ago
- Twitter meets tik tok☆10Jul 25, 2020Updated 5 years ago
- A module for normalising text.☆10Nov 6, 2019Updated 6 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆10Feb 22, 2022Updated 4 years ago
- My implementation (PyTorch) for the paper SST: Single-Stream Temporal Action Proposals (http://vision.stanford.edu/pdf/buch2017cvpr.pdf).☆10Dec 8, 2022Updated 3 years ago
- ☆12Mar 24, 2024Updated last year
- ☆11Nov 5, 2025Updated 4 months ago
- Ruby script to download bulk results from Archive.org's TV News database of closed captions☆14Mar 20, 2013Updated 12 years ago
- A video labeling platform for training classification algorithms.☆15Mar 30, 2021Updated 4 years ago
- I worked through 100 Pandas Puzzles (actually only 45) and did some data visualizations with the 2010 Denver Census data.☆12Dec 15, 2020Updated 5 years ago
- Events about the open source data stack☆13Apr 16, 2022Updated 3 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆10Updated this week
- ☆12Mar 8, 2020Updated 6 years ago
- Official code of paper IntrinsicNGP☆15Sep 25, 2023Updated 2 years ago
- ☆12Jun 3, 2016Updated 9 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Mar 23, 2018Updated 7 years ago
- ☆10Jan 5, 2020Updated 6 years ago
- Distributed URL Engine for ClickHouse (public service)☆15Jul 8, 2023Updated 2 years ago
- Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.☆13Feb 20, 2024Updated 2 years ago
- The History of Speech Recognition to the Year 2030☆13Aug 14, 2021Updated 4 years ago
- Video action classification benchmark for common CNN architectures, implemented in PyTorch☆11Jan 31, 2022Updated 4 years ago
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆13Oct 22, 2024Updated last year
- ☆10Jul 13, 2019Updated 6 years ago
- Peng et al. "RED-Net: A Recurrent Encoder–Decoder Network for Video-Based Face Alignment". IJCV, 2018.☆12Jul 19, 2018Updated 7 years ago
- Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.☆54Mar 30, 2022Updated 3 years ago
- ☆14Dec 29, 2019Updated 6 years ago
- Reproducing the Past: A Dataset for Benchmarking Inscription Restoration (ACM MM'24)☆13Oct 15, 2025Updated 4 months ago
- WildVSR☆21Dec 13, 2023Updated 2 years ago
- ☆12Dec 2, 2017Updated 8 years ago
- Speech Recognition Scoring Toolkit☆13Sep 30, 2015Updated 10 years ago
- Support code for LAEO-Net paper☆13Mar 24, 2021Updated 4 years ago
- Dead simple cron service for making HTTP calls on a regular schedule.☆14Jul 11, 2020Updated 5 years ago
- Python Notebook for a workshop at Convercon Ireland 2019. The title is How to Curate and NLP Dataset and is about a process to find error…☆13Feb 16, 2020Updated 6 years ago