Chinese words classification using lipnet with pytorch
☆40Nov 18, 2019Updated 6 years ago
Alternatives and similar repositories for LipNet_ChineseWordsClassification
Users that are interested in LipNet_ChineseWordsClassification are comparing it to the libraries listed below
Sorting:
- 2019年“创青春.交子杯”新网银行高校金融科技挑战赛-AI算法赛道比赛_代码分享☆88Jul 15, 2020Updated 5 years ago
- LipNet with gluon☆23Nov 22, 2022Updated 3 years ago
- The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxi…☆235Sep 21, 2022Updated 3 years ago
- Skeleton Graph Convolution Network is based on the Deep Graph Library and inspired by ST-GCN network designing.☆16May 11, 2019Updated 6 years ago
- lip_reading_demo_net☆32Oct 22, 2019Updated 6 years ago
- An OpenCV demo on detecting whether a person is speaking or not.☆23Mar 21, 2012Updated 14 years ago
- ☆65Oct 8, 2018Updated 7 years ago
- sk-cnn is proposed in Skeleton based action recognition with convolutional neural network(PR 2016). Here implemented in Keras☆19Apr 10, 2018Updated 7 years ago
- The proposed method in LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild☆26Nov 23, 2018Updated 7 years ago
- Automated Lip Reading using Deep Reinforcement Learning☆32Jun 24, 2018Updated 7 years ago
- Code for our EMNLP 2022 paper: Generative Entity Typing with Curriculum Learning.☆13Aug 19, 2023Updated 2 years ago
- Modification of YOLOv3 by applying EfficientNet as a backbone instead of Darknet53☆14Aug 2, 2019Updated 6 years ago
- Speech Recognition without audio input☆144Jan 14, 2019Updated 7 years ago
- Code and models for evaluating a state-of-the-art lip reading network☆196Mar 24, 2023Updated 2 years ago
- #DNN #CNN #LSTM #Classification #Sequential_data #Lip_reading☆28Jun 3, 2018Updated 7 years ago
- Keras Filter Response Normalization Layer.☆15Mar 30, 2020Updated 5 years ago
- PyTorch implementation of Human Action Recognition Based on Spatial-Temporal Attention at ICLR 2019☆14Dec 12, 2018Updated 7 years ago
- Use human pose information to help action recognition, explored with attention-pooling method, C3D method and two-stream architecture, im…☆18Jun 7, 2018Updated 7 years ago
- A replication of Google DeepMind's paper End-to-End Sentence-level Lipreading☆28Sep 26, 2017Updated 8 years ago
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- A version of Obamanet that you won't go insane setting up.☆17Nov 21, 2022Updated 3 years ago
- ☆13Nov 6, 2021Updated 4 years ago
- My experiments in lip reading using deep learning with the LRW dataset☆53Mar 14, 2021Updated 5 years ago
- ☆11Sep 1, 2024Updated last year
- ☆22Dec 15, 2023Updated 2 years ago
- Companion code for Awe the Audience: How the Narrative Trajectories Affect Audience Perception in Public Speaking☆14Jan 6, 2018Updated 8 years ago
- Unofficial implementation for SOLO instance segmentation☆25Mar 29, 2020Updated 5 years ago
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated last year
- Building Pytorch Server with Flask☆31Mar 12, 2018Updated 8 years ago
- An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.☆10May 13, 2020Updated 5 years ago
- 视频点击预测大赛-TOP1方案☆89Jan 20, 2022Updated 4 years ago
- I-Vector Speaker recognition system implemented with MSRIT in matlab☆15Jan 12, 2016Updated 10 years ago
- A Pytorch (support batch and channel) implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech…☆12Jul 24, 2024Updated last year
- Deep Variational Information Bottleneck (DVIB) in PyTorch.☆10Apr 25, 2020Updated 5 years ago
- Code for Self-and-Collaborative Attention Network from "SCAN: Self-and-Collaborative Attention Network for Video Person Re-identification…☆26Jun 1, 2019Updated 6 years ago
- ☆14Jul 27, 2022Updated 3 years ago
- ☆12Sep 14, 2020Updated 5 years ago
- A structured parsing technique for NER☆15May 26, 2023Updated 2 years ago
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆13Sep 6, 2024Updated last year