pinterest / atg-research
β63Updated 2 months ago
Alternatives and similar repositories for atg-research
Users that are interested in atg-research are comparing it to the libraries listed below
Sorting:
- Official implementation of "Divide & Bind Your Attention for Improved Generative Semantic Nursing" (BMVC 2023 Oral)β36Updated last year
- π¦Ύ EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automaticβ¦β74Updated 4 months ago
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Modelsβ84Updated last year
- Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024β¦β38Updated 5 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"β63Updated last year
- Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)β56Updated last year
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Dataβ34Updated last year
- β70Updated 5 months ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).β75Updated 11 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpeningβ58Updated 2 months ago
- β17Updated 9 months ago
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrβ¦β76Updated 5 months ago
- π€ Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".β41Updated last year
- [ICML 2024] Compositional Image Decomposition with Diffusion Modelsβ50Updated 10 months ago
- Official Implementation of Nabla-GFlowNet (ICLR 2025)β19Updated last week
- A curated list of papers and resources for text-to-image evaluation.β29Updated last year
- Reward Guided Latent Consistency Distillationβ23Updated 7 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generationβ80Updated last year
- The official repository of "Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models".β50Updated last year
- The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utilityβ48Updated 3 months ago
- β23Updated 10 months ago
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'β17Updated 7 months ago
- Training code for CLIP-FlanT5β26Updated 9 months ago
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)β87Updated 5 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.β47Updated 7 months ago
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"β27Updated last year
- β30Updated 3 months ago
- Official implementation of the paper The Hidden Language of Diffusion Modelsβ72Updated last year
- Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcingβ51Updated 2 weeks ago
- we propose to generate a series of geometric shapes with target colors to disentangle (or peel off ) the target colors from the shapes. Bβ¦β58Updated 7 months ago