paul-shuvo / gazetimationLinks
Gaze estimation from 2D image
☆12Updated last year
Alternatives and similar repositories for gazetimation
Users that are interested in gazetimation are comparing it to the libraries listed below
Sorting:
- A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication,…☆87Updated 6 months ago
- Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"☆28Updated 2 years ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆108Updated last year
- A complete end-to-end Deep Learning system to generate high quality human like speech in English for Korean Drama (WIP)☆13Updated 3 years ago
- Automated Lip Reading using Deep Reinforcement Learning☆32Updated 7 years ago
- Video chat apps with computer vision filters built on top of Streamlit☆50Updated 2 years ago
- ☆21Updated 3 years ago
- Modify-Anything is based on yolov5,yolov8 for video and image detection. Segment-anything,lama_cleaner is applied to segment, modify, era…☆17Updated 2 years ago
- End-to-end pipeline for lip reading at the word level using a tensorflow CNN implementation.☆35Updated 5 years ago
- InfAnFace: Bridging the infant-adult domain gap in facial landmark estimation in the wild (ICPR2022)☆12Updated 3 weeks ago
- ☆27Updated 4 years ago
- SpeechYOLO Interspeech 2019☆46Updated 3 years ago
- Pytorch Implementation of the Explainable Conditional Adversarial Autoencoder using Saliency Maps and SHAP (J. of Imaging - MDPI)☆12Updated 10 months ago
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆93Updated 6 months ago
- ☆10Updated 4 years ago
- Facial Expression Recognition (FER) for Mental Health Detection applies AI models like Swin Transformer, CNN, and ViT for detecting emot…☆29Updated last year
- Identify the emotion of multiple speakers in an Audio Segment☆178Updated 2 years ago
- Speech to Facial Animation using GANs☆40Updated 4 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆74Updated 6 years ago
- A python library for face detection and features extraction based on mediapipe library☆48Updated last year
- SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings☆15Updated 2 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated 2 years ago
- [InterSpeech'2023] "Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion"☆13Updated last year
- The official code for our paper "A Framework for Integrating Gesture Generation Models into Interactive Conversational Agents", published…☆36Updated 4 years ago
- Learning Lip Sync of Obama from Speech Audio☆66Updated 5 years ago
- Python scripts using the Mediapipe models for Halloween.☆43Updated 4 years ago
- A simple voice conversion tool☆19Updated 3 years ago
- a PyTorch implementation of Lip2Wav☆51Updated 3 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 3 years ago
- ☆16Updated 6 years ago