ssundaram21 / dreamsim
DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight) / / / / When Does Perceptual Alignment Benefit Vision Representations? (NeurIPS 2024)
☆408Updated last week
Related projects ⓘ
Alternatives and complementary repositories for dreamsim
- [NeurIPS'23] Emergent Correspondence from Image Diffusion☆619Updated 6 months ago
- PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.☆394Updated 6 months ago
- ☆456Updated last year
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆99Updated 7 months ago
- [ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation☆459Updated 3 weeks ago
- 🚀 Cross attention map tools for huggingface/diffusers☆155Updated last week
- [WACV 2024] Training-Free Layout Control with Cross-Attention Guidance☆240Updated 8 months ago
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"☆379Updated 7 months ago
- Learning from synthetic data - code and models☆303Updated 10 months ago
- Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".☆281Updated last month
- Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"☆339Updated 6 months ago
- [CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor☆506Updated 7 months ago
- Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024☆319Updated last month
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆242Updated 3 weeks ago
- [CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"☆290Updated 5 months ago
- Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.☆284Updated 4 months ago
- ☆223Updated last year
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆389Updated 2 weeks ago
- DiffSeg is an unsupervised zero-shot segmentation method using attention information from a stable-diffusion model. This repo implements …☆271Updated 4 months ago
- ☆446Updated 9 months ago
- The official Pytorch Implementation for ElasticDiffusion: Training-free Arbitrary Size Image Generation (CVPR 2024)☆154Updated 7 months ago
- Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)☆528Updated 7 months ago
- ☆113Updated 8 months ago
- ☆170Updated 7 months ago
- official repo for Asyrp : Diffusion Models already have a Semantic Latent Space (ICLR2023)☆254Updated last year
- [Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation☆212Updated 2 weeks ago
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆703Updated 9 months ago
- Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis☆310Updated last year
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆267Updated last year
- [ICCV 2023] Efficient Diffusion Training via Min-SNR Weighting Strategy☆229Updated 7 months ago