Using an LSTM and 4d convolutional network for lip reading
☆12May 11, 2018Updated 8 years ago
Alternatives and similar repositories for machine-lip-reading
Users that are interested in machine-lip-reading are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Automated Lip Reading using Deep Reinforcement Learning☆32Jun 24, 2018Updated 7 years ago
- Smartcrop, a multi-pass context-aware cropping tool☆11Jan 31, 2018Updated 8 years ago
- CNN for visual speech recognition☆23Dec 5, 2016Updated 9 years ago
- End to End Multiview Lip Reading☆10Jan 26, 2018Updated 8 years ago
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆17Jun 5, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- create timer videos at any speed.☆15Sep 25, 2023Updated 2 years ago
- Hash Encoding, Point Cloud Reconstruction, Multi-view Reconstruction, CVM2023, (CVMJ)☆17Mar 12, 2024Updated 2 years ago
- AuScope Geomodels Portal back-end☆16May 19, 2026Updated last week
- ☆11Aug 22, 2018Updated 7 years ago
- Pytorch implementation of Light FlowNet☆15Jun 22, 2019Updated 6 years ago
- Official code of paper IntrinsicNGP☆15Sep 25, 2023Updated 2 years ago
- Code base for our paper " Adversarial Scene Editing: Automatic Object Removal from Weak Supervision" appearing in NIPS 2018.☆58Jan 2, 2019Updated 7 years ago
- For all kinds of textual analysis: literary, social media, surveys...☆32Nov 22, 2021Updated 4 years ago
- ☆10Jul 13, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Basic watermarking libraries for images and videos with python 3.☆47Jan 10, 2018Updated 8 years ago
- Retinal image processing with python and opencv☆17May 18, 2019Updated 7 years ago
- Contains code for C3D, LCN and TSM for action recognition models.☆10May 31, 2020Updated 5 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆10Feb 22, 2022Updated 4 years ago
- Companion toolkit of the 'Serial Speakers' dataset.☆11Feb 17, 2020Updated 6 years ago
- A replication of Google DeepMind's paper End-to-End Sentence-level Lipreading☆28Sep 26, 2017Updated 8 years ago
- ☆10Feb 19, 2021Updated 5 years ago
- MutRex - A generator of fault detecting strings for regular expressions☆13Mar 18, 2024Updated 2 years ago
- A CUDA powered audio decoding framework for FLAC.☆11May 22, 2018Updated 8 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated this week
- ☆19Sep 18, 2021Updated 4 years ago
- Video action classification benchmark for common CNN architectures, implemented in PyTorch☆11Jan 31, 2022Updated 4 years ago
- ☆13May 9, 2022Updated 4 years ago
- Keras implementation of 'LipNet: End-to-End Sentence-level Lipreading'☆689Nov 22, 2022Updated 3 years ago
- ☆13Feb 25, 2025Updated last year
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Mar 23, 2018Updated 8 years ago
- DrFAQ is a plug-and-play question answering NLP chatbot that can be generally applied to any organisation's text corpora.☆29Mar 12, 2022Updated 4 years ago
- ☆13Oct 25, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.☆13Feb 20, 2024Updated 2 years ago
- ☆11Nov 5, 2025Updated 6 months ago
- Twitter meets tik tok☆10Jul 25, 2020Updated 5 years ago
- My experiments in lip reading using deep learning with the LRW dataset☆54Mar 14, 2021Updated 5 years ago
- Ruby script to download bulk results from Archive.org's TV News database of closed captions☆14Mar 20, 2013Updated 13 years ago
- Text-based media editing interface☆16Aug 9, 2017Updated 8 years ago
- The implementation of g2pL with a new open dataset.☆16May 14, 2023Updated 3 years ago