zhliuworks / EyeLipCropperLinks
✂️ EyeLipCropper is a Python tool to crop eyes and mouth ROIs of the given video.
☆14Updated 3 years ago
Alternatives and similar repositories for EyeLipCropper
Users that are interested in EyeLipCropper are comparing it to the libraries listed below
Sorting:
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆160Updated 5 years ago
- Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)☆22Updated last year
- ☆18Updated last year
- A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.☆234Updated last year
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆20Updated 11 months ago
- Tools for downloading VoxCeleb2 dataset☆30Updated last year
- The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the…☆163Updated 2 years ago
- ☆169Updated last year
- ☆39Updated 7 months ago
- Audio-Visual Speech Separation with Cross-Modal Consistency☆232Updated last year
- Implementation of the paper: Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks (INTERSPEECH 2021)☆31Updated 3 years ago
- ☆16Updated 2 months ago
- Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)☆24Updated 4 months ago
- Official PyTorch implementation of paper Leveraging Unimodal Self Supervised Learning for Multimodal Audio-Visual Speech Recognition (ACL…☆65Updated 3 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆42Updated 2 years ago
- Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023☆86Updated last year
- Voice Face Association Learning Paper List☆16Updated 2 years ago
- ☆150Updated 2 years ago
- ☆81Updated last month
- A ResNet Speaker Recognition&Verification Demo☆26Updated 3 years ago
- Visual speech recognition with face inputs: code and models for F&G 2020 paper "Can We Read Speech Beyond the Lips? Rethinking RoI Select…☆19Updated 4 years ago
- [ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…☆175Updated last year
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆116Updated last year
- A summary of speech data augment algorithms☆69Updated 4 years ago
- ☆17Updated 7 months ago
- Pytorch implementation☆9Updated 5 years ago
- Official Implementation of Visual Transformer Pooling for Lip reading☆40Updated 2 years ago
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆25Updated last year
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆210Updated 2 years ago
- ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASS…☆419Updated 2 years ago