☆22Mar 31, 2022Updated 4 years ago
Alternatives and similar repositories for End-to-End-Lip-Synchronization-with-a-Temporal-AutoEncoder
Users that are interested in End-to-End-Lip-Synchronization-with-a-Temporal-AutoEncoder are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Automatically generate a lip-synced avatar based off of a transcript and audio☆15Feb 17, 2023Updated 3 years ago
- ☆24Oct 8, 2021Updated 4 years ago
- ☆14Jun 16, 2023Updated 2 years ago
- Code & demo for the animation of still facial landmarks from an initial pose.☆15Jan 19, 2023Updated 3 years ago
- This dataset is presented in the paper Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel's Weekly Video…☆12Sep 21, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Python implementation of the paper " Dynamic Temporal Alignment of Speech to Lips"☆32May 16, 2019Updated 6 years ago
- Talking head animation☆28Dec 8, 2023Updated 2 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆111Mar 19, 2024Updated 2 years ago
- ☆10Jan 26, 2021Updated 5 years ago
- FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.☆383Jun 30, 2022Updated 3 years ago
- ☆72Jun 4, 2023Updated 2 years ago
- Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)☆56Jan 29, 2024Updated 2 years ago
- SyncNet for Time Synchronization☆30Mar 13, 2023Updated 3 years ago
- Automatic audiovisual translation with lip-syncing☆10Dec 21, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Sep 7, 2020Updated 5 years ago
- AlignNet: A Unifying Approach to Audio-Visual Alignment (WACV 2020)☆34Jan 10, 2021Updated 5 years ago
- End to End Multiview Lip Reading☆10Jan 26, 2018Updated 8 years ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆16Jul 19, 2023Updated 2 years ago
- Official SpiceDB client library for Ruby☆21Apr 28, 2026Updated last week
- NeurIPS 2022☆39Nov 23, 2022Updated 3 years ago
- Official repository of "SplatArmor: Articulated Gaussian splatting for animatable humans from monocular RGB videos"☆20Nov 29, 2023Updated 2 years ago
- Official Pytorch Implementation of 3DV2021 paper: SAFA: Structure Aware Face Animation.☆184Oct 24, 2022Updated 3 years ago
- ☆15Dec 11, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation☆490Apr 15, 2024Updated 2 years ago
- Official repository of Tapir Lab.'s Lip-Sync Method☆10Oct 3, 2023Updated 2 years ago
- ☆30Jun 30, 2020Updated 5 years ago
- Inference of resemble denoiser☆30Mar 11, 2024Updated 2 years ago
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆17Apr 13, 2023Updated 3 years ago
- Code and model for paper <Mutual Information Maximization for Effective Lip Reading>☆19Sep 4, 2020Updated 5 years ago
- 基于DINet的推理服务,推理视频流和视频☆17Nov 8, 2023Updated 2 years ago
- Code for One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning (AAAI 2022)☆359Jan 16, 2023Updated 3 years ago
- Official implementation of 'Out-of-domain GAN inversion via Invertibility Decomposition for Photo-Realistic Human Face Manipulation'☆23Feb 29, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [CVPR 2023] Official PyTorch implementation of MoStGAN-V☆24Jun 15, 2023Updated 2 years ago
- Wav2Lip-Emotion extends Wav2Lip to modify facial expressions of emotions via L1 reconstruction and pre-trained emotion objectives. We als…☆98May 23, 2022Updated 3 years ago
- Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices☆74Apr 7, 2024Updated 2 years ago
- ☆15Apr 29, 2025Updated last year
- the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"☆107May 12, 2024Updated last year
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder☆69Jul 21, 2024Updated last year
- Webpage of "Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer"☆12Jul 2, 2024Updated last year