facebookresearch / audio2photorealView external linksLinks
Code and dataset for photorealistic Codec Avatars driven from audio
☆2,855Sep 15, 2024Updated last year
Alternatives and similar repositories for audio2photoreal
Users that are interested in audio2photoreal are comparing it to the libraries listed below
Sorting:
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆1,793Jan 15, 2024Updated 2 years ago
- Character Animation (AnimateAnyone, Face Reenactment)☆3,487May 31, 2024Updated last year
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆5,020Jul 2, 2024Updated last year
- Unofficial Implementation of Animate Anyone☆2,930Jul 9, 2024Updated last year
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,153Jan 10, 2025Updated last year
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,365Jan 24, 2025Updated last year
- PhotoMaker [CVPR 2024]☆10,118Oct 31, 2024Updated last year
- [CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"☆10,906Aug 29, 2025Updated 5 months ago
- [CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"☆1,614Sep 18, 2025Updated 4 months ago
- Official implementation of DreaMoving☆1,802Jan 9, 2024Updated 2 years ago
- [CVPR 2024] Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework.☆357Jan 28, 2025Updated last year
- StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation☆10,652Dec 4, 2024Updated last year
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆5,025Jan 9, 2026Updated last month
- Official implementation of "En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data", CVPR 2024; 3D Avatar Gene…☆534Nov 25, 2024Updated last year
- GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code☆2,660Oct 18, 2024Updated last year
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,824Jun 28, 2024Updated last year
- Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation☆14,800Sep 20, 2025Updated 4 months ago
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆35,918Apr 19, 2025Updated 9 months ago
- Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code☆1,089Oct 18, 2024Updated last year
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆7,212Aug 5, 2024Updated last year
- VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior☆804Dec 5, 2023Updated 2 years ago
- [ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion …☆1,604Aug 15, 2024Updated last year
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆13,590Jun 26, 2024Updated last year
- [CVPR 2024] Official repository for "Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians"☆856Jul 12, 2024Updated last year
- Code of SIGGRAPH 2023 Conference paper: StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video☆488Aug 6, 2023Updated 2 years ago
- This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"☆408Feb 23, 2024Updated last year
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,654Mar 5, 2025Updated 11 months ago
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,222Apr 8, 2024Updated last year
- 📖 A curated list of resources dedicated to talking face.☆1,539Dec 23, 2024Updated last year
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,841Mar 7, 2025Updated 11 months ago
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,909Jul 18, 2024Updated last year
- FaceChain is a deep-learning toolchain for generating your Digital-Twin.☆9,501Jun 6, 2025Updated 8 months ago
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆9,687May 27, 2025Updated 8 months ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,991Sep 8, 2024Updated last year
- ☆724Feb 9, 2024Updated 2 years ago
- Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions☆7,641Aug 21, 2024Updated last year
- Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".☆300May 30, 2025Updated 8 months ago
- [CVPR 2023] MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation☆547May 21, 2023Updated 2 years ago
- Official implementation of AnimateDiff.☆12,018Jul 31, 2024Updated last year