1jsingh / Divide-Evaluate-and-RefineView external linksLinks
Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback
☆27Nov 11, 2023Updated 2 years ago
Alternatives and similar repositories for Divide-Evaluate-and-Refine
Users that are interested in Divide-Evaluate-and-Refine are comparing it to the libraries listed below
Sorting:
- CIFAR-10-Warehouse: Towards Broad and More Realistic Testbeds in Model Generalization Analysis☆18Jul 15, 2024Updated last year
- hierarchical multi-agent workflow for prompt optimazation☆14Jun 12, 2024Updated last year
- Taylor videos and Taylor-transformed skeletons (ICML 2024).☆16Jul 25, 2024Updated last year
- This repository includes various baseline techniques for label-free model evaluation task for the VDU2023 competition.☆19Mar 8, 2023Updated 2 years ago
- ☆43May 30, 2025Updated 8 months ago
- [WACV 2024] Official Implementation of TIAM - A Metric for Evaluating Alignment in Text-to-Image Generation☆19Feb 3, 2025Updated last year
- [ICLR 2024 Spotlight] Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments☆20Aug 19, 2025Updated 5 months ago
- TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering☆181Apr 29, 2024Updated last year
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆40May 9, 2024Updated last year
- ☆16Dec 30, 2021Updated 4 years ago
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21May 8, 2023Updated 2 years ago
- [ICML'21] Estimate the accuracy of the classifier in various environments through self-supervision☆27Sep 2, 2021Updated 4 years ago
- Reward Guided Latent Consistency Distillation☆26Oct 9, 2024Updated last year
- Official PyTorch implementation of CVPRW 2022 paper "Attention Consistency on Visual Corruptions for Single-Source Domain Generalization"☆29Feb 22, 2023Updated 2 years ago
- Code for our CVPR-2021 paper on Combining Semantic Guidance and Deep Reinforcement Learning For Generating Human Level Paintings.☆27Jan 6, 2022Updated 4 years ago
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges☆30Sep 24, 2023Updated 2 years ago
- VisualGPTScore for visio-linguistic reasoning☆27Oct 7, 2023Updated 2 years ago
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆31May 29, 2023Updated 2 years ago
- [Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation☆331Dec 24, 2025Updated last month
- Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization☆61Sep 19, 2025Updated 4 months ago
- Code for our paper: Learning Camera Movement Control from Real-World Drone Videos☆34Apr 16, 2025Updated 10 months ago
- [CVPR 2020] A generative model with latent factors that are independent and localized.☆12Mar 27, 2025Updated 10 months ago
- Official Implementation of Diffusion Step Annealing (DiSA) in Autoregressive Image Generation☆144May 27, 2025Updated 8 months ago
- Automatic model evaluation (AutoEval) in CVPR'21&TPAMI'22☆37Oct 20, 2022Updated 3 years ago
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization☆264Apr 7, 2025Updated 10 months ago
- Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"☆41Aug 9, 2022Updated 3 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- [ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"☆11Sep 3, 2024Updated last year
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated last month
- AAAI 2025: Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation☆44May 28, 2024Updated last year
- [NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆168Nov 18, 2024Updated last year
- ☆46Oct 27, 2023Updated 2 years ago
- ☆18Nov 20, 2024Updated last year
- 北京交通大学本科毕业设计(论文)LaTeX 模板(非官方)|Bachelor Thesis LaTeX Template for Beijing Jiaotong University (unofficial)☆10Jun 20, 2022Updated 3 years ago
- ☆12Nov 23, 2021Updated 4 years ago
- [ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…☆11Nov 28, 2023Updated 2 years ago
- SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards☆28Jan 28, 2026Updated 2 weeks ago
- ☆13Mar 2, 2025Updated 11 months ago
- ☆10Jul 5, 2024Updated last year