ldynx / SAVELinks
☆25Updated last year
Alternatives and similar repositories for SAVE
Users that are interested in SAVE are comparing it to the libraries listed below
Sorting:
- ☆62Updated 2 years ago
- [ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion☆59Updated 2 months ago
- Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Fee…☆27Updated 2 years ago
- [CVPR 2023] Zero-shot Generative Model Adaptation via Image-specific Prompt Learning☆83Updated 2 years ago
- ☆47Updated last year
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Updated 2 years ago
- ☆56Updated 5 months ago
- Code for CVPR 2024 Oral "Neural Lineage"☆17Updated last year
- VisualGPTScore for visio-linguistic reasoning☆27Updated 2 years ago
- Implementation and dataset for paper "Can MLLMs Perform Text-to-Image In-Context Learning?"☆42Updated 8 months ago
- [WACV 2024] Official Implementation of TIAM - A Metric for Evaluating Alignment in Text-to-Image Generation☆19Updated last year
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆34Updated 2 months ago
- Augmenting with Language-guided Image Augmentation (ALIA)☆80Updated 2 years ago
- [CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation☆64Updated 6 months ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆33Updated 10 months ago
- [ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"☆21Updated 10 months ago
- [ECCV 2024] Official code for "Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation"☆18Updated 6 months ago
- [NeurIPS 2024] Official implementation of "Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance"☆17Updated last year
- Official Repository of Personalized Visual Instruct Tuning☆34Updated 11 months ago
- ☆82Updated last year
- Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"☆19Updated 11 months ago
- Code and data for the paper "Emergent Visual-Semantic Hierarchies in Image-Text Representations" (ECCV 2024)☆33Updated last year
- Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"☆33Updated last year
- Compress conventional Vision-Language Pre-training data☆53Updated 2 years ago
- Official implementation of TCL (CVPR 2023)☆120Updated 2 years ago
- ☆33Updated 3 years ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆153Updated last year
- Adapting LLaMA Decoder to Vision Transformer☆30Updated last year
- Unified layout planning and image generation, ICCV2025☆40Updated 2 weeks ago
- (CVPR 2024) "Unsegment Anything by Simulating Deformation"☆29Updated last year