Chinese words classification using lipnet with pytorch
☆40Nov 18, 2019Updated 6 years ago
Alternatives and similar repositories for LipNet_ChineseWordsClassification
Users that are interested in LipNet_ChineseWordsClassification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 2019年“创青春·交子杯”新网银行高校金融科技挑战赛初赛、决赛思路代码分享☆28Dec 11, 2019Updated 6 years ago
- 2019年“创青春.交子杯”新网银行高校金融科技挑战赛-AI算法赛道比赛_代码分享☆89Jul 15, 2020Updated 5 years ago
- The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxi…☆237Sep 21, 2022Updated 3 years ago
- "LipNet: End-to-End Sentence-level Lipreading" in PyTorch☆69Sep 9, 2019Updated 6 years ago
- CNN for visual speech recognition☆23Dec 5, 2016Updated 9 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Skeleton Graph Convolution Network is based on the Deep Graph Library and inspired by ST-GCN network designing.☆16May 11, 2019Updated 7 years ago
- ☆11May 31, 2020Updated 5 years ago
- An OpenCV demo on detecting whether a person is speaking or not.☆23Mar 21, 2012Updated 14 years ago
- lip_reading_demo_net☆32Oct 22, 2019Updated 6 years ago
- ☆64Oct 8, 2018Updated 7 years ago
- sk-cnn is proposed in Skeleton based action recognition with convolutional neural network(PR 2016). Here implemented in Keras☆19Apr 10, 2018Updated 8 years ago
- This repo is used for generating faking labeled positive videos for SVD dataset.☆10Aug 16, 2020Updated 5 years ago
- a pytorch implementation to fine-grained few shot classification using triplet loss☆11Feb 24, 2019Updated 7 years ago
- Simulation of movement of a human character using forward and inverse kinematics☆13May 22, 2016Updated 10 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The proposed method in LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild☆26Nov 23, 2018Updated 7 years ago
- Automated Lip Reading using Deep Reinforcement Learning☆32Jun 24, 2018Updated 7 years ago
- Audio-Visual Speech Recognition using Deep Learning☆61Nov 14, 2018Updated 7 years ago
- Speech Recognition without audio input☆143May 5, 2026Updated 2 weeks ago
- Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released☆11Nov 8, 2021Updated 4 years ago
- Code and models for evaluating a state-of-the-art lip reading network☆196Mar 24, 2023Updated 3 years ago
- PyTorch implementation of Human Action Recognition Based on Spatial-Temporal Attention at ICLR 2019☆14Dec 12, 2018Updated 7 years ago
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- [2026'ICLR] Official Code for SurfSplat☆76Apr 21, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- My experiments in lip reading using deep learning with the LRW dataset☆54Mar 14, 2021Updated 5 years ago
- ☆12Sep 19, 2021Updated 4 years ago
- [CVPR 2026 Findings] SwiftVGGT: A Scalable Visual Geometry Grounded Transformer for Large-Scale Scenes☆85Nov 25, 2025Updated 5 months ago
- Companion code for Awe the Audience: How the Narrative Trajectories Affect Audience Perception in Public Speaking☆14Jan 6, 2018Updated 8 years ago
- ☆19Jan 18, 2019Updated 7 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.☆10May 13, 2020Updated 6 years ago
- 这是我的深度强化学习的学习笔记与总结☆73Mar 18, 2026Updated 2 months ago
- 使用C++ 语言写的Ftp服务端,用户登陆《完成了目录显示, 上传文件,下载文件,删除文件,重命名文件等主要功能。 项目主要涉及到socket和FTP协议知识☆14Jul 16, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A Pytorch (support batch and channel) implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech…☆12Jul 24, 2024Updated last year
- Code for Self-and-Collaborative Attention Network from "SCAN: Self-and-Collaborative Attention Network for Video Person Re-identification…☆26Jun 1, 2019Updated 6 years ago
- ☆13May 13, 2017Updated 9 years ago
- ☆18May 6, 2019Updated 7 years ago
- Sparse Label Smoothing Regularization for Person Re-Identification☆41May 13, 2019Updated 7 years ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago
- Audio-Visual Speech Recognition☆24Jul 7, 2025Updated 10 months ago