LipNet with gluon
☆23Nov 22, 2022Updated 3 years ago
Alternatives and similar repositories for lipnet
Users that are interested in lipnet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- End-to-end pipeline for lip reading at the word level using a tensorflow CNN implementation.☆35Feb 15, 2020Updated 6 years ago
- A replication of Google DeepMind's paper End-to-End Sentence-level Lipreading☆28Sep 26, 2017Updated 8 years ago
- Chinese words classification using lipnet with pytorch☆40Nov 18, 2019Updated 6 years ago
- Code and model for paper <Mutual Information Maximization for Effective Lip Reading>☆19Sep 4, 2020Updated 5 years ago
- lip_reading_demo_net☆32Oct 22, 2019Updated 6 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆25Oct 28, 2020Updated 5 years ago
- End to End Multiview Lip Reading☆10Jan 26, 2018Updated 8 years ago
- Local File Inclusion (LFI) in FHEM 6.0 allows an attacker to include a file, it can lead to sensitive information disclosure.☆12Jan 20, 2021Updated 5 years ago
- Pytorch code for End-to-End Audiovisual Speech Recognition☆184Nov 18, 2022Updated 3 years ago
- 🎮 Use a Raspberry Pi to control a LoPy over UART☆12Mar 9, 2017Updated 9 years ago
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆94Jul 23, 2025Updated 8 months ago
- ☆65Oct 8, 2018Updated 7 years ago
- Get Tunisian translation, audio and sample sentence for the most common 20.000 english word☆13Jan 20, 2024Updated 2 years ago
- A library for interfacing with the 4.3inch UART e-Paper from a Raspberry Pi 2/3 via Python3 with example programs to display QR Codes for…☆12Mar 9, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Color Coherence Vector is a powerful color-based image retrieval (Matlab)☆11Feb 27, 2015Updated 11 years ago
- This project applies Monte Carlo Tree Search (MCTS) to a simple grid world.☆10May 30, 2018Updated 7 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Mar 23, 2018Updated 8 years ago
- Automated Lip reading from real-time videos in tensorflow in python☆164Mar 20, 2018Updated 8 years ago
- 这是一个Matlab代码,里面包括五种常见神经网络优化算法的对比。包括SGD、SGDM、Adagrad、AdaDelta、Adam☆11Mar 23, 2022Updated 4 years ago
- Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…☆14Jul 2, 2020Updated 5 years ago
- ☆15Apr 27, 2017Updated 8 years ago
- Pytorch implementation of deep fill v2 (original by Jiayu et al.)☆10Jun 26, 2019Updated 6 years ago
- Deep Learning Study with Gluon☆59Jun 3, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆22Aug 4, 2024Updated last year
- Conformer encoder + Transformer decoder with Hybrid CTC/attention☆12Nov 11, 2021Updated 4 years ago
- ☆37Dec 23, 2020Updated 5 years ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆36Aug 30, 2025Updated 7 months ago
- Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection☆34Apr 18, 2022Updated 3 years ago
- End-To-End SpeechSynthesis system with knowledge distillation☆18Jul 16, 2022Updated 3 years ago
- Composable metric reporters in Python.☆14Jun 6, 2024Updated last year
- Lip Reading in the Wild using ResNet and LSTMs in PyTorch☆58Apr 23, 2018Updated 7 years ago
- A research project exploring fine-tuning BERT-style models for text generation☆40Nov 30, 2025Updated 4 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Monte Carlo tree search (MCTS) on traveling salesman problem (TSP)☆22Apr 27, 2019Updated 6 years ago
- Converting VIS json label to VOS format☆12Feb 16, 2021Updated 5 years ago
- An introduction to Natural Language Processing (NLP) course☆46Jan 1, 2022Updated 4 years ago
- Keras implementation of 'LipNet: End-to-End Sentence-level Lipreading'☆688Nov 22, 2022Updated 3 years ago
- CNN for visual speech recognition☆23Dec 5, 2016Updated 9 years ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆23Jul 28, 2020Updated 5 years ago
- Lip Reading - Cross Audio-Visual Recognition using 3D Architectures☆1,905Nov 7, 2022Updated 3 years ago