☆22Mar 31, 2022Updated 4 years ago
Alternatives and similar repositories for End-to-End-Lip-Synchronization-with-a-Temporal-AutoEncoder
Users that are interested in End-to-End-Lip-Synchronization-with-a-Temporal-AutoEncoder are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Automatically generate a lip-synced avatar based off of a transcript and audio☆15Feb 17, 2023Updated 3 years ago
- ☆24Oct 8, 2021Updated 4 years ago
- ☆14Jun 16, 2023Updated 2 years ago
- Code & demo for the animation of still facial landmarks from an initial pose.☆15Jan 19, 2023Updated 3 years ago
- This dataset is presented in the paper Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel's Weekly Video…☆12Sep 21, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Python implementation of the paper " Dynamic Temporal Alignment of Speech to Lips"☆32May 16, 2019Updated 6 years ago
- Talking head animation☆28Dec 8, 2023Updated 2 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆111Mar 19, 2024Updated 2 years ago
- FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.☆383Jun 30, 2022Updated 3 years ago
- ☆24Feb 20, 2024Updated 2 years ago
- ☆72Jun 4, 2023Updated 2 years ago
- Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)☆56Jan 29, 2024Updated 2 years ago
- SyncNet for Time Synchronization☆30Mar 13, 2023Updated 3 years ago
- Automatic audiovisual translation with lip-syncing☆10Dec 21, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- AlignNet: A Unifying Approach to Audio-Visual Alignment (WACV 2020)☆34Jan 10, 2021Updated 5 years ago
- ☆11Sep 7, 2020Updated 5 years ago
- End to End Multiview Lip Reading☆10Jan 26, 2018Updated 8 years ago
- The official repository of "Encode-in-Style: Latent-based Video Encoding using StyleGAN2"☆47Feb 15, 2023Updated 3 years ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆16Jul 19, 2023Updated 2 years ago
- NeurIPS 2022☆39Nov 23, 2022Updated 3 years ago
- Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released☆11Nov 8, 2021Updated 4 years ago
- Official repository of "SplatArmor: Articulated Gaussian splatting for animatable humans from monocular RGB videos"☆20Nov 29, 2023Updated 2 years ago
- Official Pytorch Implementation of 3DV2021 paper: SAFA: Structure Aware Face Animation.☆184Oct 24, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation☆490Apr 15, 2024Updated 2 years ago
- ☆15Dec 11, 2021Updated 4 years ago
- Official repository of Tapir Lab.'s Lip-Sync Method☆10Oct 3, 2023Updated 2 years ago
- ☆30Jun 30, 2020Updated 5 years ago
- Inference of resemble denoiser☆30Mar 11, 2024Updated 2 years ago
- 基于DINet的推理服务,推理视频流和视频☆17Nov 8, 2023Updated 2 years ago
- Code and model for paper <Mutual Information Maximization for Effective Lip Reading>☆19Sep 4, 2020Updated 5 years ago
- Code for One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning (AAAI 2022)☆359Jan 16, 2023Updated 3 years ago
- Official implementation of 'Out-of-domain GAN inversion via Invertibility Decomposition for Photo-Realistic Human Face Manipulation'☆23Feb 29, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [CVPR 2023] Official PyTorch implementation of MoStGAN-V☆24Jun 15, 2023Updated 2 years ago
- Wav2Lip-Emotion extends Wav2Lip to modify facial expressions of emotions via L1 reconstruction and pre-trained emotion objectives. We als…☆97May 23, 2022Updated 3 years ago
- Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices☆74Apr 7, 2024Updated 2 years ago
- ☆15Apr 29, 2025Updated 11 months ago
- the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"☆107May 12, 2024Updated last year
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder☆69Jul 21, 2024Updated last year
- Webpage of "Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer"☆12Jul 2, 2024Updated last year