zhliuworks / EyeLipCropperLinks
✂️ EyeLipCropper is a Python tool to crop eyes and mouth ROIs of the given video.
☆14Updated 3 years ago
Alternatives and similar repositories for EyeLipCropper
Users that are interested in EyeLipCropper are comparing it to the libraries listed below
Sorting:
- Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)☆22Updated last year
- Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices☆66Updated last year
- ☆17Updated 6 months ago
- Tools for downloading VoxCeleb2 dataset☆30Updated last year
- Implementation of the paper: Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks (INTERSPEECH 2021)☆31Updated 3 years ago
- ☆37Updated 6 months ago
- This repository includes the code to reproduce our paper "End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification A…☆89Updated last year
- The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the…☆160Updated 2 years ago
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆24Updated last year
- ☆23Updated last year
- Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023☆86Updated last year
- ☆18Updated last year
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆160Updated 5 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆41Updated 2 years ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆21Updated 10 months ago
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆52Updated 2 years ago
- ☆23Updated last year
- Implementation of the paper: Replay and Synthetic Speech Detection with Res2Net architecture (ICASSP 2021) https://arxiv.org/abs/2010.150…☆82Updated 3 years ago
- Official PyTorch implementation of paper Leveraging Unimodal Self Supervised Learning for Multimodal Audio-Visual Speech Recognition (ACL…☆65Updated 2 years ago
- This repository presents a subset of our proposed FSD dataset for song deepfake detection.☆24Updated 8 months ago
- ☆13Updated 11 months ago
- Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).☆24Updated last year
- ☆15Updated last month
- AVT2-DWF: Improving Deepfake Detection with Audio-Visual Fusion and Dynamic Weighting Strategies☆18Updated last year
- This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Spea…☆61Updated last year
- Code for the Active Speakers in Context Paper (CVPR2020)☆54Updated 4 years ago
- Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)☆19Updated 3 months ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆53Updated 3 weeks ago
- Official Implementation of Visual Transformer Pooling for Lip reading☆40Updated 2 years ago
- This is the pytorch implementation of our work titled "An Efficient Temporary Deepfake Location Approach Based Embeddings for Partially S…☆18Updated 7 months ago