ProjectNUWA / StrokeNUWA
☆14Updated 7 months ago
Related projects: ⓘ
- Implementation and dataset for paper "Can MLLMs Perform Text-to-Image In-Context Learning?"☆22Updated last month
- ☆15Updated 11 months ago
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆11Updated last month
- [ICLR 2024] Official pytorch implementation of "Denoising Task Routing for Diffusion Models"☆16Updated 7 months ago
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆20Updated last month
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆30Updated 6 months ago
- Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…☆22Updated last month
- How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?☆13Updated last year
- The official implementation of the paper "Towards Safe Self-Distillation of Internet-Scale Text-to-Image Diffusion Models" (ICML 2023 Wor…☆15Updated 6 months ago
- Official implementation of the paper The Hidden Language of Diffusion Models☆66Updated 7 months ago
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆42Updated 9 months ago
- Official code repo for "Editing Implicit Assumptions in Text-to-Image Diffusion Models"☆81Updated last year
- [Neurips 2023] Official pytorch implementation of "Addressing Negative Transfer in Diffusion Models"☆14Updated 2 months ago
- ☆17Updated 4 months ago
- ☆30Updated 11 months ago
- [ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"☆30Updated 2 months ago
- Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models, 2023☆107Updated last year
- Official PyTorch implementation of "Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis…☆39Updated 10 months ago
- (arXiv.2405.18406) RACCooN: Remove, Add, and Change Video Content with Auto-Generated Narratives☆26Updated 3 months ago
- The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility☆17Updated last month
- The official repository of "Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models".☆46Updated 5 months ago
- MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models☆35Updated last week
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"☆10Updated 7 months ago
- Generating figures from research papers, using textual captions from the paper.☆14Updated last year
- fixed official code for paper "A Closer Look at Parameter-Efficient Tuning in Diffusion Models".☆39Updated last year
- ☆30Updated 7 months ago
- Official code for ICLR 2024 paper Do Generated Data Always Help Contrastive Learning?☆25Updated 5 months ago
- Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".☆40Updated 2 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆33Updated last month
- ☆37Updated 5 months ago