danisbet / machine-lip-readingView external linksLinks
Using an LSTM and 4d convolutional network for lip reading
☆12May 11, 2018Updated 7 years ago
Alternatives and similar repositories for machine-lip-reading
Users that are interested in machine-lip-reading are comparing it to the libraries listed below
Sorting:
- Automated Lip Reading using Deep Reinforcement Learning☆32Jun 24, 2018Updated 7 years ago
- End to End Multiview Lip Reading☆10Jan 26, 2018Updated 8 years ago
- A Keras implementation of LipNet☆24Oct 30, 2018Updated 7 years ago
- A replication of Google DeepMind's paper End-to-End Sentence-level Lipreading☆28Sep 26, 2017Updated 8 years ago
- A CUDA powered audio decoding framework for FLAC.☆11May 22, 2018Updated 7 years ago
- ☆10Feb 19, 2021Updated 4 years ago
- ☆11Jan 20, 2017Updated 9 years ago
- ☆13Oct 25, 2024Updated last year
- Ruby script to download bulk results from Archive.org's TV News database of closed captions☆14Mar 20, 2013Updated 12 years ago
- ☆12Mar 24, 2024Updated last year
- ☆11Nov 5, 2025Updated 3 months ago
- My implementation (PyTorch) for the paper SST: Single-Stream Temporal Action Proposals (http://vision.stanford.edu/pdf/buch2017cvpr.pdf).☆10Dec 8, 2022Updated 3 years ago
- Enterprise Solution for Text Classification (using BERT)☆10Dec 26, 2022Updated 3 years ago
- A module for normalising text.☆10Nov 6, 2019Updated 6 years ago
- Distributed URL Engine for ClickHouse (public service)☆15Jul 8, 2023Updated 2 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Mar 23, 2018Updated 7 years ago
- MutRex - A generator of fault detecting strings for regular expressions☆12Mar 18, 2024Updated last year
- The History of Speech Recognition to the Year 2030☆13Aug 14, 2021Updated 4 years ago
- Simple tool to import/export Elasticsearch indices into a file, and/or reshard an index☆19Jan 25, 2022Updated 4 years ago
- ☆12Mar 8, 2020Updated 5 years ago
- ☆12Jun 3, 2016Updated 9 years ago
- Hash Encoding, Point Cloud Reconstruction, Multi-view Reconstruction, CVM2023, (CVMJ)☆17Mar 12, 2024Updated last year
- ☆13Feb 25, 2025Updated 11 months ago
- Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.☆13Feb 20, 2024Updated last year
- ☆13May 9, 2022Updated 3 years ago
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆13Oct 22, 2024Updated last year
- Peng et al. "RED-Net: A Recurrent Encoder–Decoder Network for Video-Based Face Alignment". IJCV, 2018.☆12Jul 19, 2018Updated 7 years ago
- ☆10Jan 5, 2020Updated 6 years ago
- Speech Recognition Scoring Toolkit☆13Sep 30, 2015Updated 10 years ago
- ☆12Dec 2, 2017Updated 8 years ago
- Reproducing the Past: A Dataset for Benchmarking Inscription Restoration (ACM MM'24)☆13Oct 15, 2025Updated 4 months ago
- [🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …☆28Nov 1, 2025Updated 3 months ago
- Complete Web Scraping of TED.com for Metadata, Transcript, Audio, Video, Images using Parallel Programming☆11Jun 25, 2020Updated 5 years ago
- Python Notebook for a workshop at Convercon Ireland 2019. The title is How to Curate and NLP Dataset and is about a process to find error…☆13Feb 16, 2020Updated 5 years ago
- Support code for LAEO-Net paper☆13Mar 24, 2021Updated 4 years ago
- Dead simple cron service for making HTTP calls on a regular schedule.☆14Jul 11, 2020Updated 5 years ago
- Contains code for C3D, LCN and TSM for action recognition models.☆10May 31, 2020Updated 5 years ago
- Reverse engineer patterns for use with SpaCy's DependencyMatcher☆36Feb 8, 2020Updated 6 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆15Sep 13, 2017Updated 8 years ago