Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices
☆74Apr 7, 2024Updated 2 years ago
Alternatives and similar repositories for vocalist
Users that are interested in vocalist are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).☆29Apr 3, 2024Updated 2 years ago
- Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)☆56Jan 29, 2024Updated 2 years ago
- deep-learning based audio-visual lip bometrics☆15May 9, 2023Updated 2 years ago
- This is the release code for CVPR2022 paper "Voice-Face Homogeneity Tells Deepfake".☆15Mar 7, 2022Updated 4 years ago
- Out of time: automated lip sync in the wild☆881Apr 11, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆429Nov 1, 2023Updated 2 years ago
- PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"☆215Aug 8, 2023Updated 2 years ago
- [ECCV 2022] StyleHEAT: A framework for high-resolution editable talking face generation☆658Mar 26, 2023Updated 3 years ago
- Audio-Visual Speech Recognition using Sequence to Sequence Models☆84Jul 10, 2020Updated 5 years ago
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆166Apr 12, 2020Updated 6 years ago
- ☆18Jun 14, 2025Updated 10 months ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆17May 14, 2022Updated 3 years ago
- A self-supervised learning framework for audio-visual speech☆981Dec 7, 2023Updated 2 years ago
- Audio-Visual Active Speaker Detection with PyTorch on AVA-ActiveSpeaker dataset☆72Jan 18, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆528Dec 26, 2023Updated 2 years ago
- Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released☆11Nov 8, 2021Updated 4 years ago
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB ima…☆20Apr 22, 2024Updated last year
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Aug 16, 2023Updated 2 years ago
- [NeurIPS 2024] This is the official repo of the paper "Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Li…☆137Feb 9, 2025Updated last year
- Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code☆109May 1, 2022Updated 3 years ago
- [ICME 2025] DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation☆24Mar 25, 2025Updated last year
- Official pytorch implementation for Learning to Listen: Modeling Non-Deterministic Dyadic Facial Motion (CVPR 2022)☆127Aug 18, 2024Updated last year
- Official implementation of Transpotter, published in BMVC 2021☆16Aug 6, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASS…☆433May 18, 2023Updated 2 years ago
- Official Pytorch Implementation of SPECTRE: Visual Speech-Aware Perceptual 3D Facial Expression Reconstruction from Videos☆297Mar 24, 2025Updated last year
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆107May 27, 2024Updated last year
- FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.☆383Jun 30, 2022Updated 3 years ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago
- Unoffical LivePortrait Training Script [ 🚧 Under Construction]☆39Jan 28, 2025Updated last year
- Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery Detection (CVPR 2021)☆144Feb 1, 2024Updated 2 years ago
- This is the official source for our ACM MM 2023 paper "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking …☆143Dec 5, 2023Updated 2 years ago
- The Official PyTorch Implementation for Face2Face^ρ (ECCV2022)☆227May 6, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆16Jul 19, 2023Updated 2 years ago
- ☆22Mar 31, 2022Updated 4 years ago
- the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"☆424May 12, 2024Updated last year
- Visual Speech Recongnition☆20Dec 24, 2024Updated last year
- Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)☆959Jan 6, 2024Updated 2 years ago
- ☆102Oct 30, 2025Updated 5 months ago
- ☆24Feb 20, 2024Updated 2 years ago