jingyunx/Deformation-Flow-Based-Two-stream-Network-for-Lip-Reading

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jingyunx/Deformation-Flow-Based-Two-stream-Network-for-Lip-Reading)

jingyunx / Deformation-Flow-Based-Two-stream-Network-for-Lip-Reading

☆15

Alternatives and similar repositories for Deformation-Flow-Based-Two-stream-Network-for-Lip-Reading

Users that are interested in Deformation-Flow-Based-Two-stream-Network-for-Lip-Reading are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ms-dot-k / Visual-Audio-Memory
View on GitHub
PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)
☆22Apr 11, 2022Updated 4 years ago
VIPL-Audio-Visual-Speech-Understanding / learn-an-effective-lip-reading-model-without-pains
View on GitHub
The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the…
☆168Sep 12, 2025Updated 10 months ago
ms-dot-k / Multi-head-Visual-Audio-Memory
View on GitHub
PyTorch implementation of "Distinguishing Homophenes using Multi-Head Visual-Audio Memory" (AAAI2022)
☆27Mar 9, 2024Updated 2 years ago
mpc001 / end-to-end-lipreading
View on GitHub
Pytorch code for End-to-End Audiovisual Speech Recognition
☆183Nov 18, 2022Updated 3 years ago
VIPL-Audio-Visual-Speech-Understanding / deep-face-speechreading
View on GitHub
Visual speech recognition with face inputs: code and models for F&G 2020 paper "Can We Read Speech Beyond the Lips? Rethinking RoI Select…
☆19Apr 12, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
deeplsd / Merkel-Podcast-Corpus
View on GitHub
This dataset is presented in the paper Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel's Weekly Video…
☆12Sep 21, 2022Updated 3 years ago
prajwalkr / vtp
View on GitHub
Official Implementation of Visual Transformer Pooling for Lip reading
☆41Aug 8, 2022Updated 3 years ago
NirHeaven / D3D
View on GitHub
The proposed method in LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild
☆26Nov 23, 2018Updated 7 years ago
Exgc / OpenSR
View on GitHub
The official implementation of OpenSR (ACL2023 Oral)
☆17Nov 29, 2023Updated 2 years ago
Droliven / diverse_sampling
View on GitHub
Official project of DiverseSampling (ACMMM2022 Paper)
☆16Feb 25, 2023Updated 3 years ago
tgc1997 / event-based-lip-reading
View on GitHub
Multi-grained Spatio-Temporal Features Perceived Network for Event-based Lip-Reading (CVPR 2022)
☆16Jun 18, 2022Updated 4 years ago
Idiot-Coke / -Micro-expression-recognition-based-on-spatiotemporal-features
View on GitHub
Micro-expression recognition based on spatiotemporal features
☆13May 31, 2020Updated 6 years ago
sailordiary / LipNet-PyTorch
View on GitHub
"LipNet: End-to-End Sentence-level Lipreading" in PyTorch
☆70Sep 9, 2019Updated 6 years ago
VIPL-Audio-Visual-Speech-Understanding / LRW1000--CAS-VSR-W1k
View on GitHub
DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.069…
☆123Mar 13, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
liuzhejun / XWbank_LipReading
View on GitHub
2019年“创青春·交子杯”新网银行高校金融科技挑战赛初赛、决赛思路代码分享
☆28Dec 11, 2019Updated 6 years ago
lzuwei / end-to-end-multiview-lipreading
View on GitHub
End to End Multiview Lip Reading
☆10Jan 26, 2018Updated 8 years ago
LUMIA-Group / Leveraging-Self-Supervised-Learning-for-AVSR
View on GitHub
Official PyTorch implementation of paper Leveraging Unimodal Self Supervised Learning for Multimodal Audio-Visual Speech Recognition (ACL…
☆67Jul 13, 2022Updated 4 years ago
siddharthdivi / Unifying-Distillation-with-Personalization-in-Federated-Learning
View on GitHub
Repository that contains the code for the paper titled, 'Unifying Distillation with Personalization in Federated Learning'.
☆14May 31, 2021Updated 5 years ago
tianyuan168326 / EAN-Pytorch
View on GitHub
Official PyTorch Implementation of paper EAN: Event Adaptive Network for Efficient Action Recognition https://arxiv.org/abs/2107.10771
☆33Oct 24, 2023Updated 2 years ago
zhaoyi2 / CVTE_chain_model_finetune
View on GitHub
finetune the chain model based on cvte open source model without traing any GMM for frame alignment
☆12Aug 6, 2020Updated 5 years ago
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
yzfly / TCM
View on GitHub
TCM: Temporal Correlation Module
☆17Apr 24, 2021Updated 5 years ago
zgzxy001 / STMT
View on GitHub
Code for the CVPR'23 paper: "STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition"
☆21Dec 9, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Zzzzz1 / CSKD
View on GitHub
Official code for Cumulative Spatial Knowledge Distillation for Vision Transformers (ICCV-2023) https://openaccess.thecvf.com/content/ICC…
☆15Nov 5, 2023Updated 2 years ago
itsyoavshalev / End-to-End-Lip-Synchronization-with-a-Temporal-AutoEncoder
View on GitHub
☆22Mar 31, 2022Updated 4 years ago
guotaowang / STANet
View on GitHub
☆16Sep 20, 2022Updated 3 years ago
arthurhero / deep_fill_2_pytorch
View on GitHub
Pytorch implementation of deep fill v2 (original by Jiayu et al.)
☆10Jun 26, 2019Updated 7 years ago
workerman-php / redis
View on GitHub
Asynchronous redis client for PHP based on workerman.
☆21Jun 2, 2026Updated last month
Pliploop / SemiSupCon
View on GitHub
Semi-Supervised Contrastive Learning for music classification - towards HIL-representation learning.
☆17Jul 24, 2024Updated 2 years ago
google-deepmind / brave
View on GitHub
A JAX implementation of Broaden Your Views for Self-Supervised Video Learning, or BraVe for short.
☆50Updated this week
OliverRensu / TENet-Triple-Excitation-Network-for-Video-Salient-Object-Detection
View on GitHub
☆15Aug 21, 2020Updated 5 years ago
AndyTang15 / FLAG3D
View on GitHub
☆19Jun 22, 2026Updated last month
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
xweiyuan / Face-de-occlusion-using-3D-morphable-model-and-generative-adversarial-network
View on GitHub
Face de-occlusion using 3D morphable model and generative adversarial network
☆34Oct 22, 2021Updated 4 years ago
Atom-101 / FourierFeat-Siren
View on GitHub
Pytorch implementation and comparison of Fourier Feature Networks and Sinusoidal Representation Networks
☆13Jun 27, 2020Updated 6 years ago
zhaoxing2022 / MMN-VSOD
View on GitHub
☆15Jan 9, 2024Updated 2 years ago
guoyongcs / TAPADL
View on GitHub
Code of "Robustifying Token Attention for Vision Transformers"
☆20Dec 31, 2023Updated 2 years ago
asindel / ArtFacePoints
View on GitHub
High-resolution facial landmark detection in artworks
☆23Dec 17, 2023Updated 2 years ago
mpc001 / Visual_Speech_Recognition_for_Multiple_Languages
View on GitHub
Visual Speech Recognition for Multiple Languages
☆479Aug 17, 2023Updated 2 years ago
wisebobo / doc_ocr_by_template
View on GitHub
This is an OCR program designed for travel document. It can now support 23 types of documents with pre-defined template. You can add what…
☆10Nov 22, 2022Updated 3 years ago