ldynx / SAVELinks
☆25Updated last year
Alternatives and similar repositories for SAVE
Users that are interested in SAVE are comparing it to the libraries listed below
Sorting:
- ☆61Updated 2 years ago
- [ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion☆56Updated 7 months ago
- [NeurIPS 2024] Official implementation of "Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance"☆16Updated 11 months ago
- ☆55Updated 3 months ago
- Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Fee…☆26Updated 2 years ago
- [CVPR 2023] Zero-shot Generative Model Adaptation via Image-specific Prompt Learning☆83Updated 2 years ago
- [WACV 2024] Official Implementation of TIAM - A Metric for Evaluating Alignment in Text-to-Image Generation☆19Updated 9 months ago
- Augmenting with Language-guided Image Augmentation (ALIA)☆80Updated 2 years ago
- ☆47Updated last year
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Updated 2 years ago
- [ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"☆21Updated 8 months ago
- Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"☆16Updated 9 months ago
- [ECCV 2024] Official PyTorch implementation of "HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts"☆20Updated last year
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆13Updated 11 months ago
- Implementation and dataset for paper "Can MLLMs Perform Text-to-Image In-Context Learning?"☆41Updated 5 months ago
- VisualGPTScore for visio-linguistic reasoning☆27Updated 2 years ago
- [NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"☆179Updated last year
- How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?☆13Updated 2 years ago
- Official Repository of Personalized Visual Instruct Tuning☆32Updated 8 months ago
- [CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation☆64Updated 4 months ago
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆88Updated 9 months ago
- Code for CVPR 2024 Oral "Neural Lineage"☆17Updated last year
- Official implementation of TCL (CVPR 2023)☆119Updated 2 years ago
- Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models☆27Updated 2 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11Updated 2 years ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆146Updated last year
- Create generated datasets and train robust classifiers☆36Updated 2 years ago
- Official PyTorch implementation of Extract Free Dense Misalignment from CLIP (AAAI'25)☆25Updated 7 months ago
- Unified layout planning and image generation, ICCV2025☆34Updated 7 months ago
- Code and data for the paper "Emergent Visual-Semantic Hierarchies in Image-Text Representations" (ECCV 2024)☆32Updated last year