allenye66 / Computer-Vision-Lip-Reading-2.0
A speech recognition system using 3D CNNs. The final model achieves 97.4% training accuracy and a 99.2% testing accuracy and the system can accurately recognize spoken words from a set of pre-defined words in real-time.
☆42Updated last year
Alternatives and similar repositories for Computer-Vision-Lip-Reading-2.0:
Users that are interested in Computer-Vision-Lip-Reading-2.0 are comparing it to the libraries listed below
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆79Updated 3 years ago
- ☆178Updated 7 months ago
- End-to-end pipeline for lip reading at the word level using a tensorflow CNN implementation.☆33Updated 5 years ago
- Identify a voice as male or female.☆33Updated 7 years ago
- Deep Visual Speech Recognition in arabic words☆16Updated last year
- Official Code implementation for the ICLR paper "LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading"☆59Updated 5 months ago
- [Interspeech 2024] SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization☆46Updated 2 months ago
- Designed and Developed end-to-end scalable Deep Learning Project. It is a detection system trained using InceptionV3(CNN model) + GRU(S…☆38Updated 6 months ago
- Auto-AVSR: Lip-Reading Sentences Project☆310Updated last month
- Detecting Anxiety and Depression using facial emotion recognition and speech emotion recognition. Written in pythonPython☆56Updated 3 years ago
- Project Made during Virtual Summer Internship under leadingindia.ai and BENNETT UNIVERSITY.☆92Updated 2 years ago
- ☆15Updated 3 years ago
- We'll look into audio categorization using deep learning principles like Artificial Neural Networks (ANN), 1D Convolutional Neural Networ…☆41Updated 2 years ago
- ☆33Updated 4 years ago
- Face Emotion Recognition using Machine Learning Python☆30Updated 11 months ago
- [Interspeech 2023] Intelligible Lip-to-Speech Synthesis with Speech Units☆27Updated 3 months ago
- This project is a real-time deepfake detection system implemented in PyTorch. Deepfakes are manipulated videos or images that use artific…☆19Updated last year
- Automated Lip reading from real-time videos in tensorflow in python☆160Updated 6 years ago
- Python library & framework to build custom translators for the hearing-impaired and translate between Sign Language & Text using Artifici…☆197Updated 4 months ago
- Speech Emotion Detection using SVM, Decision Tree, Random Forest, MLP, CNN with different architectures☆34Updated last year
- Detecting Deepfakes Without Seeing Any☆151Updated 6 months ago
- Code for the paper "Real Time Speech Emotion Recognition using Machine Learning"☆22Updated 3 years ago
- An elegant store for buying artistic products from all over India.☆10Updated last year
- Voice stress analysis (VSA) aims to differentiate between stressed and non-stressed outputs in response to stimuli (e.g., questions posed…☆91Updated 3 years ago
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆13Updated 3 years ago
- Detecting depression in a conversation using Convolutional Neral Network☆68Updated 3 years ago
- Developed and trained Gated-CNN models to detect types of stutter in speech and SVM classifier to suggest new therapies to the user accor…☆20Updated 3 years ago
- A Sign Language Learning Platform where who know sign language can come and practice Sign Language and also people who don't know can lea…☆35Updated 6 months ago
- Sign Language Translator enables the hearing impaired user to communicate efficiently in sign language, and the application will translat…☆38Updated 10 months ago