facebookresearch / audio2photorealLinks
Code and dataset for photorealistic Codec Avatars driven from audio
☆2,832Updated 11 months ago
Alternatives and similar repositories for audio2photoreal
Users that are interested in audio2photoreal are comparing it to the libraries listed below
Sorting:
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆1,757Updated last year
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,128Updated 7 months ago
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,352Updated 7 months ago
- Official implementation of DreaMoving☆1,802Updated last year
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,999Updated last year
- Character Animation (AnimateAnyone, Face Reenactment)☆3,430Updated last year
- [ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion …☆1,585Updated last year
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,586Updated 5 months ago
- Convert your videos to densepose and use it on MagicAnimate☆1,098Updated last year
- ☆2,458Updated last year
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,768Updated last year
- Let us democratise high-resolution generation! (CVPR 2024)☆2,025Updated last year
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,176Updated last year
- Mora: More like Sora for Generalist Video Generation☆1,567Updated 10 months ago
- Unofficial Implementation of Animate Anyone☆2,935Updated last year
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,930Updated 11 months ago
- Text-to-Audio/Music Generation☆2,486Updated 11 months ago
- [ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing☆1,437Updated last year
- Foundational model for human-like, expressive TTS☆4,155Updated last year
- VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior☆793Updated last year
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,310Updated last year
- MagicEdit: High-Fidelity Temporally Coherent Video Editing☆1,801Updated 2 years ago
- Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR …☆1,675Updated 6 months ago
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,437Updated last month
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,822Updated 6 months ago
- [CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"☆1,543Updated 3 weeks ago
- Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions☆7,649Updated last year
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,754Updated 5 months ago
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆1,961Updated 11 months ago
- [WIP] Layer Diffusion for WebUI (via Forge)☆4,093Updated last year