The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on pytorch.
☆11Mar 23, 2018Updated 7 years ago
Alternatives and similar repositories for WLAS
Users that are interested in WLAS are comparing it to the libraries listed below
Sorting:
- End to End Multiview Lip Reading☆10Jan 26, 2018Updated 8 years ago
- Peng et al. "RED-Net: A Recurrent Encoder–Decoder Network for Video-Based Face Alignment". IJCV, 2018.☆12Jul 19, 2018Updated 7 years ago
- Python toolkit for Visual Speech Recognition☆38Jun 10, 2020Updated 5 years ago
- Code and model for paper <Mutual Information Maximization for Effective Lip Reading>☆19Sep 4, 2020Updated 5 years ago
- Add motion-based magic to your React Native apps! ThinkSys Mediapipe Plugin offers real-time pose detection for iOS, with easy integratio…☆32Jan 19, 2026Updated last month
- 🎮 Use a Raspberry Pi to control a LoPy over UART☆12Mar 9, 2017Updated 8 years ago
- VoxSRC Challenge☆31Jun 11, 2019Updated 6 years ago
- ☆10Apr 16, 2020Updated 5 years ago
- python library☆12Nov 25, 2025Updated 3 months ago
- Audio-Visual Speech Recognition using Sequence to Sequence Models☆83Jul 10, 2020Updated 5 years ago
- Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).☆32Jun 27, 2019Updated 6 years ago
- Color Coherence Vector is a powerful color-based image retrieval (Matlab)☆11Feb 27, 2015Updated 11 years ago
- ☆10Dec 16, 2018Updated 7 years ago
- C library for speech pre-processing.☆12Jun 7, 2019Updated 6 years ago
- golang package to provide lightweight internal pub/sub for goroutines☆29Jan 23, 2014Updated 12 years ago
- Tool for Evaluating Multilingual WS-353 and SimLex-999☆10Dec 15, 2016Updated 9 years ago
- 这是一个Matlab代码,里面包括五种常见神经网络优化算法的对比。包括SGD、SGDM、Adagrad、AdaDelta、Adam☆11Mar 23, 2022Updated 3 years ago
- Neural-Network Guided Expression Transformation☆13Apr 15, 2018Updated 7 years ago
- Octave port of the Fast Image Source Model by Eric A. Lehmann. Used for room acoustic modeling and impulse response simulation.☆12Aug 2, 2017Updated 8 years ago
- Add Rain Streak Mask On Unparied Image Using GAN☆10Sep 12, 2020Updated 5 years ago
- Portal Tutorial☆11Feb 3, 2018Updated 8 years ago
- Curating Cognitive Behavioral Therapy☆13Dec 21, 2023Updated 2 years ago
- Efficient minimax optimization for deep adversarial learning, and more.☆10Mar 28, 2019Updated 6 years ago
- ☆10Feb 19, 2021Updated 5 years ago
- ☆10Oct 2, 2017Updated 8 years ago
- [CVPR 2019] Official Matlab implementation of OSD: Unsupervised image matching and object discovery as optimization.☆12Nov 4, 2021Updated 4 years ago
- Simple to use monitoring server application written in Go, extendable with scripts.☆11Aug 4, 2020Updated 5 years ago
- A CUDA powered audio decoding framework for FLAC.☆11May 22, 2018Updated 7 years ago
- Personal project. Pipeline to extract clothes from a picture. Models not provided.☆12Jun 13, 2018Updated 7 years ago
- Listen to a Redis PubSub chanhel and then rebroadcast over WebSockets.☆12Jun 23, 2016Updated 9 years ago
- This is fork of code.google.com/p/snappy-go.☆11Mar 8, 2015Updated 10 years ago
- My solution of tasks of course "DAT208x Introduction to Python for Data Science"☆11Dec 29, 2016Updated 9 years ago
- The missing UIKit Toolbox. Easy to use extensions for UI, Layout and Animation.☆12Mar 21, 2019Updated 6 years ago
- Postman collections for Redfish requests against HPE servers☆13Apr 18, 2021Updated 4 years ago
- ☆11Apr 20, 2024Updated last year
- A synthetic training data generator for a text recognition CNN☆10Jul 8, 2019Updated 6 years ago
- An application of stacked denoising autoencoders to multi-modal (images and audio) abstract feature discovery☆12Oct 23, 2013Updated 12 years ago
- DOneLogin Android: Facial verification for Two-Factors Authentication (2FA) on Android platform☆11Mar 30, 2021Updated 4 years ago
- Docker Bind 1.9 image with Webmin Interface☆11Oct 29, 2020Updated 5 years ago