georgesterpu/pyVSR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/georgesterpu/pyVSR)

georgesterpu / pyVSR

Python toolkit for Visual Speech Recognition

☆37

Alternatives and similar repositories for pyVSR

Users that are interested in pyVSR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lzuwei / end-to-end-multiview-lipreading
View on GitHub
End to End Multiview Lip Reading
☆10Jan 26, 2018Updated 8 years ago
artem179 / WLAS
View on GitHub
The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…
☆11Mar 23, 2018Updated 8 years ago
georgesterpu / avsr-tf1
View on GitHub
Audio-Visual Speech Recognition using Sequence to Sequence Models
☆84Jul 10, 2020Updated 6 years ago
euancrabtree / Lipreading-PyTorch
View on GitHub
Lip Reading in the Wild using ResNet and LSTMs in PyTorch
☆57Apr 23, 2018Updated 8 years ago
afourast / deep_lip_reading
View on GitHub
Code and models for evaluating a state-of-the-art lip reading network
☆196Mar 24, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
LeeYongHyeok / DCM_vgg_transformer
View on GitHub
Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…
☆14Jul 2, 2020Updated 6 years ago
sailordiary / LipNet-PyTorch
View on GitHub
"LipNet: End-to-End Sentence-level Lipreading" in PyTorch
☆70Sep 9, 2019Updated 6 years ago
tomkocse / sim-rir-preparation
View on GitHub
Script to simulate room impulse responses
☆16Sep 29, 2016Updated 9 years ago
mpc001 / end-to-end-lipreading
View on GitHub
Pytorch code for End-to-End Audiovisual Speech Recognition
☆183Nov 18, 2022Updated 3 years ago
VIPL-Audio-Visual-Speech-Understanding / deep-face-speechreading
View on GitHub
Visual speech recognition with face inputs: code and models for F&G 2020 paper "Can We Read Speech Beyond the Lips? Rethinking RoI Select…
☆19Apr 12, 2021Updated 5 years ago
ajinkyaT / Lip_Reading_in_the_Wild_AVSR
View on GitHub
Audio-Visual Speech Recognition using Deep Learning
☆61Nov 14, 2018Updated 7 years ago
Faur / TIMIT
View on GitHub
Framewise phoneme classification on the TIMIT dataset using neural networks
☆19Jul 14, 2016Updated 10 years ago
joseph-zhong / LipReading
View on GitHub
Speech Recognition without audio input
☆144May 5, 2026Updated 2 months ago
tstafylakis / Lipreading-ResNet
View on GitHub
Torch code for using Residual Networks with LSTMs for Lipreading
☆99Oct 8, 2018Updated 7 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
tstafylakis / Speaker-Embeddings-Correlation-Pooling
View on GitHub
Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"
☆11Sep 20, 2021Updated 4 years ago
joonson / voxsrc_2019
View on GitHub
VoxSRC Challenge
☆31Jun 11, 2019Updated 7 years ago
matthijsvk / TCDTIMITprocessing
View on GitHub
processing and extracting of face and mouth image files out of the TCDTIMIT database
☆47Sep 22, 2020Updated 5 years ago
matthijsvk / multimodalSR
View on GitHub
Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.
☆69Nov 19, 2022Updated 3 years ago
Bucknalla / lopy-raspberrypi
View on GitHub
🎮 Use a Raspberry Pi to control a LoPy over UART
☆12Mar 9, 2017Updated 9 years ago
prajwalkr / transpotter
View on GitHub
Official implementation of Transpotter, published in BMVC 2021
☆16Aug 6, 2022Updated 3 years ago
anicolson / matlab_feat
View on GitHub
Functions for creating speech features in MATLAB.
☆14Jul 7, 2020Updated 6 years ago
VIPL-Audio-Visual-Speech-Understanding / learn-an-effective-lip-reading-model-without-pains
View on GitHub
The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the…
☆168Sep 12, 2025Updated 10 months ago
etzinis / biased_separation
View on GitHub
Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation
☆14Nov 16, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kaist-ami / 3d-talking-head-av-guidance
View on GitHub
[INTERSPEECH'24] Official repository for "Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert"
☆19Jun 25, 2025Updated last year
jarret / raspi-uart-waveshare
View on GitHub
A library for interfacing with the 4.3inch UART e-Paper from a Raspberry Pi 2/3 via Python3 with example programs to display QR Codes for…
☆12Mar 9, 2019Updated 7 years ago
onolab-tmu / asp-tutorial-2022
View on GitHub
Ono laboratory audio signal processing exercise for beginners.
☆19May 10, 2023Updated 3 years ago
bsxfan / meta-embeddings
View on GitHub
Meta-embeddings are a probabilistic generalization of embeddings in machine learning.
☆23Nov 23, 2018Updated 7 years ago
TarekVito / ColorCoherenceVector
View on GitHub
Color Coherence Vector is a powerful color-based image retrieval (Matlab)
☆11Feb 27, 2015Updated 11 years ago
topherbuckley / fast_ISM
View on GitHub
Octave port of the Fast Image Source Model by Eric A. Lehmann. Used for room acoustic modeling and impulse response simulation.
☆12Aug 2, 2017Updated 8 years ago
lshiwjx / deformable-3d-convnets
View on GitHub
Deformable 3D ConvNets for Action Recognition
☆10Jan 21, 2018Updated 8 years ago
saschaschramm / MonteCarloTreeSearch
View on GitHub
This project applies Monte Carlo Tree Search (MCTS) to a simple grid world.
☆10May 30, 2018Updated 8 years ago
afourast / avobjects
View on GitHub
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
☆114Nov 16, 2020Updated 5 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
nesl / WiFislam
View on GitHub
Android WiFi capturing and indoor localization using SLAM
☆13Oct 10, 2013Updated 12 years ago
dalinvip / PyTorch_Chinese_word_segmentation
View on GitHub
Chinese word segmentation with the neural seq2seq model implement in pytorch
☆10Dec 13, 2017Updated 8 years ago
Li-Sanze / ID-Card
View on GitHub
给定一张身份证正、反面，识别身份证上的所有文字信息
☆10Sep 4, 2019Updated 6 years ago
MlWoo / WaveRNN-TF
View on GitHub
☆15Oct 11, 2019Updated 6 years ago
lelechen63 / 3d_gan
View on GitHub
☆34Jul 25, 2018Updated 7 years ago
shenzhun / speech-recognition-using-pocketsphinx
View on GitHub
Using pocketsphinx, cmuclmtk and NLTK to build speech recognition system
☆13Sep 23, 2013Updated 12 years ago
hassanhub / LipReading
View on GitHub
☆64Oct 8, 2018Updated 7 years ago