poloclub / ClickDiffusion
ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing
☆67Updated 10 months ago
Alternatives and similar repositories for ClickDiffusion:
Users that are interested in ClickDiffusion are comparing it to the libraries listed below
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆55Updated 3 months ago
- Distilling Diversity and Control in Diffusion Models☆35Updated 2 weeks ago
- ☆3Updated 6 months ago
- ☆22Updated 3 months ago
- ☆33Updated last year
- [ICLR 2024] Official repo. for Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis☆104Updated last year
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆52Updated 3 weeks ago
- Edge Weight Prediction For Category-Agnostic Pose Estimation☆41Updated 4 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆98Updated last month
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆129Updated last year
- [ACM MM24] Official implementation of ACM MM 2024 paper: "ZePo: Zero-Shot Portrait Stylization with Faster Sampling"☆39Updated 7 months ago
- Website source code for our ACM MM'23 paper "Hierarchical Masked 3D Diffusion Model for Video Outpainting".☆42Updated 11 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated 8 months ago
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆80Updated 8 months ago
- Official Implementation of GrounDiT (NeurIPS 2024)☆50Updated 4 months ago
- Implementation of the text to video model LUMIERE from the paper: "A Space-Time Diffusion Model for Video Generation" by Google Research☆51Updated 2 months ago
- ☆83Updated 7 months ago
- Scaling Vision Pre-Training to 4K Resolution☆110Updated 2 weeks ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆100Updated 9 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆63Updated 6 months ago
- Pusa: Thousands-handed Video Diffusion Model☆11Updated 2 weeks ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆48Updated 2 months ago
- Official PyTorch implementation of TokenSet.☆113Updated 3 weeks ago
- [NeurIPS 2024] Official Implementation of Attention Interpolation of Text-to-Image Diffusion☆97Updated 4 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆66Updated 6 months ago
- Scripts to teach Flux the task of image editing from language with the Flux Control framework.☆64Updated 3 weeks ago
- Official Implementation of weights2weights☆140Updated last month
- [arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices☆110Updated 2 months ago
- ☆30Updated 6 months ago
- ☆65Updated last year