facebookresearch / audio2photorealLinks
Code and dataset for photorealistic Codec Avatars driven from audio
☆2,807Updated 8 months ago
Alternatives and similar repositories for audio2photoreal
Users that are interested in audio2photoreal are comparing it to the libraries listed below
Sorting:
- Official implementation of DreaMoving☆1,800Updated last year
- Foundational model for human-like, expressive TTS☆4,133Updated 10 months ago
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,338Updated 4 months ago
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆1,730Updated last year
- Character Animation (AnimateAnyone, Face Reenactment)☆3,394Updated last year
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,868Updated 9 months ago
- Convert your videos to densepose and use it on MagicAnimate☆1,092Updated last year
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,108Updated 4 months ago
- Unofficial Implementation of Animate Anyone☆2,928Updated 10 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,950Updated 11 months ago
- [ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion …☆1,577Updated 9 months ago
- [ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance☆4,204Updated 10 months ago
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,549Updated 3 months ago
- ☆728Updated last year
- The official implementation of HierSpeech++☆1,220Updated last year
- VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior☆789Updated last year
- Let us democratise high-resolution generation! (CVPR 2024)☆2,012Updated last year
- Inference and training library for high-quality TTS models.☆5,285Updated 5 months ago
- [ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion☆758Updated 11 months ago
- Unofficial Implementation of Animate Anyone by Novita AI☆776Updated last year
- [SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation☆2,984Updated last year
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,483Updated 8 months ago
- Official implementation of "En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data", CVPR 2024; 3D Avatar Gene…☆509Updated 6 months ago
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,721Updated 11 months ago
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆7,059Updated 10 months ago
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,149Updated last year
- Generative models for conditional audio generation☆3,308Updated this week
- GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code☆2,605Updated 7 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,608Updated 10 months ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆6,140Updated 5 months ago