LorenzoGianassi / Land-DiffuserLinks
The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation from raw audio inputs.
☆13Updated last year
Alternatives and similar repositories for Land-Diffuser
Users that are interested in Land-Diffuser are comparing it to the libraries listed below
Sorting:
- ☆15Updated 6 months ago
- ☆12Updated last year
- Dual-Branch Network for Portrait Image Quality Assessment☆17Updated last month
- [ICIAP 2023] Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation☆62Updated last year
- Official Code for "Intelligent Painter: Picture Composition With Resampling Diffusion Model" (ICIP 2023)☆16Updated 2 years ago
- [CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models☆36Updated last year
- ☆16Updated last year
- ☆33Updated 2 years ago
- Official repository for Polarity Sampling, CVPR 2022 ORAL☆13Updated 3 years ago
- Diffusion Cocktail: Mixing Domain-Specific Diffusion Models for Diversified Image Generations☆25Updated last year
- ☆10Updated 2 years ago
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆43Updated 2 years ago
- GANalyzer: Analysis and Manipulation of GANs Latent Space for Controllable Face Synthesis☆39Updated last year
- DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models☆45Updated last year
- [CVPR 2023] Official PyTorch implementation of MoStGAN-V☆24Updated 2 years ago
- [AAAI 2024] stle2talker - Official PyTorch Implementation☆46Updated 3 months ago
- DoodleFormer: Creative Sketch Drawing with Transformers (ECCV22)☆29Updated 3 years ago
- Wire Removal Video Datasets 2(WRV2)☆46Updated 3 months ago
- Adaptive Nonlinear Latent Transformation for Conditional Face Editing (ICCV 2023)☆37Updated 2 years ago
- My Implementation of " Structure and Content-Guided Video Synthesis with Diffusion Models" by RunwayML☆25Updated last year
- Official code for SeMani (CVPR 2020 oral and Journal extension)☆23Updated last year
- ☆47Updated 2 years ago
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆52Updated last year
- [ICCV 2023] Controllable Person Image Synthesis with Pose‑Constrained Latent Diffusion☆43Updated 2 years ago
- Fine-grained Image Editing by Pixel-wise Guidance Using Diffusion Models☆40Updated 2 years ago
- ☆33Updated 2 years ago
- Face Parsing via SegNeXt, trained on CelebAMask-HQ☆15Updated last year
- Official PyTorch implementation of "Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis…☆45Updated 2 years ago
- This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)☆23Updated last year
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆32Updated last year