michaelzhang-ai / Speech2VideoView external linksLinks
Code for ACCV 2020 "Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses"
☆100Apr 8, 2021Updated 4 years ago
Alternatives and similar repositories for Speech2Video
Users that are interested in Speech2Video are comparing it to the libraries listed below
Sorting:
- A modified version of vid2vid for Speech2Video, Text2Video Paper☆36Jun 4, 2023Updated 2 years ago
- ☆208Mar 10, 2021Updated 4 years ago
- ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".☆441Jun 4, 2023Updated 2 years ago
- ☆15Oct 28, 2019Updated 6 years ago
- Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalize…☆774Dec 15, 2023Updated 2 years ago
- Official github repo for paper "What comprises a good talking-head video generation?: A Survey and Benchmark"☆91Dec 8, 2022Updated 3 years ago
- Code for SEEG: Semantic Energized Co-speech Gesture Generation☆33Dec 3, 2022Updated 3 years ago
- AudioDVP:Photorealistic Audio-driven Video Portraits☆301Feb 27, 2024Updated last year
- Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)☆961Jan 6, 2024Updated 2 years ago
- This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character me…☆1,247Aug 20, 2024Updated last year
- A repository for generating stylized talking 3D and 3D face☆279Nov 11, 2021Updated 4 years ago
- ☆521Aug 14, 2025Updated 6 months ago
- Talking Face Generation by Conditional Recurrent Adversarial Network☆61Dec 6, 2019Updated 6 years ago
- An improved version of APB2Face: Real-Time Audio-Guided Multi-Face Reenactment☆84Oct 7, 2021Updated 4 years ago
- Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)☆816May 11, 2021Updated 4 years ago
- This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".☆1,068Oct 27, 2023Updated 2 years ago
- Code for MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement☆399Oct 3, 2022Updated 3 years ago
- Mocap Dataset of “Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation”☆161Oct 15, 2021Updated 4 years ago
- code for training the models from the paper "Learning Individual Styles of Conversational Gestures"☆394Mar 6, 2024Updated last year
- PyTorch implementation of our graph convolutional network (GCN) for human motion generation from music. Also with paired dance-music data…☆90Jan 28, 2024Updated 2 years ago
- ☆11Dec 30, 2022Updated 3 years ago
- This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Transla…☆613Jun 22, 2025Updated 7 months ago
- This is the official PyTorch implementation of the CVPR 2020 paper "TransMoMo: Invariance-Driven Unsupervised Video Motion Retargeting".☆405Mar 18, 2021Updated 4 years ago
- ☆967Sep 10, 2023Updated 2 years ago
- This github contains the network architectures of NeuralVoicePuppetry.☆179Jun 12, 2020Updated 5 years ago
- Official Repository for the paper Style Transfer for Co-Speech Gesture Animation: A Multi-Speaker Conditional-Mixture Approach published …☆30Jun 24, 2024Updated last year
- Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)☆1,283Jun 19, 2023Updated 2 years ago
- Code for paper 'Audio-Driven Emotional Video Portraits'.☆314Mar 16, 2022Updated 3 years ago
- CVPR 2019☆259May 24, 2023Updated 2 years ago
- Human Video Generation Paper List☆476Mar 2, 2024Updated last year
- ☆487Aug 8, 2023Updated 2 years ago
- [ECCV 2022] StyleHEAT: A framework for high-resolution editable talking face generation☆658Mar 26, 2023Updated 2 years ago
- official repo for AAAI ALOHA chatbot☆29Dec 28, 2023Updated 2 years ago
- wav2lip in a Vector Quantized (VQ) space☆27Jun 20, 2023Updated 2 years ago
- This repository contains the code for my master thesis on Emotion-Aware Facial Animation☆147Dec 8, 2022Updated 3 years ago
- Speech Gesture Generation from the Trimodal Context of Text, Audio, and Speaker Identity (SIGGRAPH Asia 2020)☆273Dec 14, 2021Updated 4 years ago
- FLNet: Landmark Driven Fetching and Learning Network for Faithful Talking Facial Animation Synthesis☆26Nov 19, 2019Updated 6 years ago
- ☆94Aug 7, 2021Updated 4 years ago
- Collection of works from VIPL-AVSU☆50Aug 2, 2025Updated 6 months ago