poloclub / ClickDiffusion
ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing
☆66Updated 7 months ago
Alternatives and similar repositories for ClickDiffusion:
Users that are interested in ClickDiffusion are comparing it to the libraries listed below
- ☆20Updated 3 weeks ago
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆54Updated last month
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆75Updated 5 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆44Updated last month
- ☆80Updated 4 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆63Updated 3 months ago
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆84Updated last year
- Website source code for our ACM MM'23 paper "Hierarchical Masked 3D Diffusion Model for Video Outpainting".☆42Updated 8 months ago
- Gradio app to track objects in video and add visual effects☆16Updated 3 months ago
- [ICLR 2024] Official repo. for Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis☆105Updated last year
- Official implementation of "Art-Free Generative Models: Art Creation Without Graphic Art Knowledge"☆30Updated 3 weeks ago
- [ACM MM24] Official implementation of ACM MM 2024 paper: "ZePo: Zero-Shot Portrait Stylization with Faster Sampling"☆36Updated 4 months ago
- ☆3Updated 3 months ago
- Official Implementation of GrounDiT (NeurIPS 2024)☆46Updated last month
- The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting☆108Updated 2 weeks ago
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆48Updated this week
- Official Implementation of weights2weights☆135Updated last month
- Edge Weight Prediction For Category-Agnostic Pose Estimation☆35Updated last month
- ☆33Updated 11 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆41Updated 5 months ago
- Video-LlaVA fine-tune for CinePile evaluation☆46Updated 5 months ago
- ☆66Updated 3 months ago
- ☆65Updated last year
- Official PyTorch implementation of "6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face Geometry," ECCV 2024☆68Updated 6 months ago
- ☆30Updated 3 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆47Updated this week
- Paint by Inpaint: Learning to Add Image Objects by Removing Them First☆93Updated 4 months ago
- Implementation of the text to video model LUMIERE from the paper: "A Space-Time Diffusion Model for Video Generation" by Google Research☆51Updated 2 months ago
- ☆65Updated 2 months ago