Code and dataset for photorealistic Codec Avatars driven from audio
☆2,858Sep 15, 2024Updated last year
Alternatives and similar repositories for audio2photoreal
Users that are interested in audio2photoreal are comparing it to the libraries listed below
Sorting:
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆1,792Jan 15, 2024Updated 2 years ago
- Character Animation (AnimateAnyone, Face Reenactment)☆3,492May 31, 2024Updated last year
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆5,018Jul 2, 2024Updated last year
- Unofficial Implementation of Animate Anyone☆2,932Jul 9, 2024Updated last year
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,154Jan 10, 2025Updated last year
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,365Jan 24, 2025Updated last year
- PhotoMaker [CVPR 2024]☆10,125Oct 31, 2024Updated last year
- [CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"☆10,908Aug 29, 2025Updated 6 months ago
- [CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"☆1,620Sep 18, 2025Updated 5 months ago
- Official implementation of DreaMoving☆1,801Jan 9, 2024Updated 2 years ago
- [CVPR 2024] Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework.☆357Feb 22, 2026Updated 2 weeks ago
- StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation☆10,671Dec 4, 2024Updated last year
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆5,035Jan 9, 2026Updated 2 months ago
- Official implementation of "En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data", CVPR 2024; 3D Avatar Gene…☆533Nov 25, 2024Updated last year
- GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code☆2,659Oct 18, 2024Updated last year
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,827Jun 28, 2024Updated last year
- Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation☆14,799Sep 20, 2025Updated 5 months ago
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆36,049Apr 19, 2025Updated 10 months ago
- Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code☆1,093Oct 18, 2024Updated last year
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆7,218Aug 5, 2024Updated last year
- VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior☆803Dec 5, 2023Updated 2 years ago
- [ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion …☆1,606Aug 15, 2024Updated last year
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆13,629Jun 26, 2024Updated last year
- [CVPR 2024] Official repository for "Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians"☆856Jul 12, 2024Updated last year
- Code of SIGGRAPH 2023 Conference paper: StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video☆488Aug 6, 2023Updated 2 years ago
- This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"☆411Feb 23, 2024Updated 2 years ago
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,656Mar 5, 2025Updated last year
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,219Apr 8, 2024Updated last year
- 📖 A curated list of resources dedicated to talking face.☆1,538Dec 23, 2024Updated last year
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,839Mar 7, 2025Updated last year
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,924Jul 18, 2024Updated last year
- FaceChain is a deep-learning toolchain for generating your Digital-Twin.☆9,499Jun 6, 2025Updated 9 months ago
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆9,706May 27, 2025Updated 9 months ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,995Sep 8, 2024Updated last year
- ☆724Feb 9, 2024Updated 2 years ago
- Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions☆7,638Aug 21, 2024Updated last year
- Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".☆300May 30, 2025Updated 9 months ago
- [CVPR 2023] MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation☆546May 21, 2023Updated 2 years ago
- Official implementation of AnimateDiff.☆12,045Jul 31, 2024Updated last year