poloclub / ClickDiffusionLinks
ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing
☆69Updated last year
Alternatives and similar repositories for ClickDiffusion
Users that are interested in ClickDiffusion are comparing it to the libraries listed below
Sorting:
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆54Updated 5 months ago
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆58Updated 2 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆48Updated 3 months ago
- ☆3Updated 8 months ago
- ☆84Updated 9 months ago
- Gradio app to track objects in video and add visual effects☆16Updated 2 weeks ago
- Implementation of the text to video model LUMIERE from the paper: "A Space-Time Diffusion Model for Video Generation" by Google Research☆50Updated 4 months ago
- [ACM MM24] Official implementation of ACM MM 2024 paper: "ZePo: Zero-Shot Portrait Stylization with Faster Sampling"☆41Updated 9 months ago
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆27Updated 9 months ago
- Distilling Diversity and Control in Diffusion Models☆40Updated last month
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆128Updated last year
- ☆22Updated 5 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 6 months ago
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆81Updated 10 months ago
- Scripts to teach Flux the task of image editing from language with the Flux Control framework.☆82Updated 2 months ago
- [ICLR 2024] Official repo. for Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis☆104Updated last year
- Official implementation of MagicFace: Training-free Universal-Style Human Image Customized Synthesis.☆63Updated 5 months ago
- Official Implementation of GrounDiT (NeurIPS 2024)☆53Updated 5 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated 10 months ago
- Paper: "From Text to Pose to Image: Improving Diffusion Model Control and Quality"☆51Updated 6 months ago
- ☆22Updated 6 months ago
- ☆28Updated 10 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆64Updated 8 months ago
- ☆68Updated 11 months ago
- Website source code for our ACM MM'23 paper "Hierarchical Masked 3D Diffusion Model for Video Outpainting".☆41Updated last year
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆64Updated 9 months ago
- The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting☆119Updated 5 months ago
- A Gradio component that can be used to annotate images with bounding boxes.☆52Updated 3 months ago
- ☆30Updated last year
- Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group