facebookresearch / audio2photorealLinks
Code and dataset for photorealistic Codec Avatars driven from audio
☆2,822Updated 10 months ago
Alternatives and similar repositories for audio2photoreal
Users that are interested in audio2photoreal are comparing it to the libraries listed below
Sorting:
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆1,746Updated last year
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,113Updated 6 months ago
- Official implementation of DreaMoving☆1,801Updated last year
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,342Updated 5 months ago
- Character Animation (AnimateAnyone, Face Reenactment)☆3,417Updated last year
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,978Updated last year
- [ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion …☆1,584Updated 11 months ago
- Let us democratise high-resolution generation! (CVPR 2024)☆2,020Updated last year
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,563Updated 4 months ago
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,746Updated last year
- Convert your videos to densepose and use it on MagicAnimate☆1,095Updated last year
- Mora: More like Sora for Generalist Video Generation☆1,565Updated 9 months ago
- ☆2,453Updated last year
- [SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation☆2,985Updated last year
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,168Updated last year
- VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior☆792Updated last year
- Unofficial Implementation of Animate Anyone☆2,934Updated last year
- Foundational model for human-like, expressive TTS☆4,135Updated 11 months ago
- Unofficial Implementation of Animate Anyone by Novita AI☆778Updated last year
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,900Updated 10 months ago
- Text-to-Audio/Music Generation☆2,467Updated 9 months ago
- [ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing☆1,436Updated last year
- The official implementation of HierSpeech++☆1,224Updated last year
- Inference and training library for high-quality TTS models.☆5,342Updated 7 months ago
- MagicEdit: High-Fidelity Temporally Coherent Video Editing☆1,799Updated last year
- [CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"☆1,529Updated 3 weeks ago
- ☆30Updated last year
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆8,319Updated 4 months ago
- MARS5 speech model (TTS) from CAMB.AI☆2,780Updated 11 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,814Updated 5 months ago