1jsingh / Divide-Evaluate-and-Refine
Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback
☆26Updated last year
Alternatives and similar repositories for Divide-Evaluate-and-Refine:
Users that are interested in Divide-Evaluate-and-Refine are comparing it to the libraries listed below
- ☆57Updated last year
- VisualGPTScore for visio-linguistic reasoning☆26Updated last year
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆27Updated last month
- [NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"☆170Updated 10 months ago
- ☆56Updated 2 years ago
- (NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment☆26Updated 3 months ago
- Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆60Updated 5 months ago
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆44Updated last year
- ☆75Updated last month
- [ECCV2024] Learning Video Context as Interleaved Multimodal Sequences☆32Updated this week
- Official Implementation of VideoDPO☆37Updated last week
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆31Updated 10 months ago
- Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"☆55Updated last year
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Updated last year
- Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆54Updated last year
- Augmenting with Language-guided Image Augmentation (ALIA)☆70Updated last year
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆68Updated last year
- [ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization☆54Updated last year
- Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)☆14Updated 2 months ago
- Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation. CVPR 2023☆58Updated 2 months ago
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models☆59Updated 7 months ago
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆64Updated 2 months ago
- [ACM MM 22] Correspondence Matters for Video Referring Expression Comprehension☆15Updated 2 years ago
- HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data (Accepted by CVPR 2024)☆43Updated 6 months ago
- ☆14Updated this week
- Turning to Video for Transcript Sorting☆48Updated last year
- NegCLIP.☆29Updated last year
- ☆97Updated 8 months ago
- ☆55Updated 8 months ago
- ☆28Updated last year