ldynx / SAVELinks
☆27Updated 8 months ago
Alternatives and similar repositories for SAVE
Users that are interested in SAVE are comparing it to the libraries listed below
Sorting:
- ☆59Updated last year
- Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Fee…☆26Updated last year
- Code for CVPR 2024 Oral "Neural Lineage"☆17Updated last year
- ☆53Updated 4 months ago
- [ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"☆20Updated 4 months ago
- Augmenting with Language-guided Image Augmentation (ALIA)☆76Updated last year
- [CVPR 2023] Zero-shot Generative Model Adaptation via Image-specific Prompt Learning☆83Updated 2 years ago
- [ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion☆50Updated 3 months ago
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Updated 2 years ago
- [WACV 2024] Official Implementation of TIAM - A Metric for Evaluating Alignment in Text-to-Image Generation☆19Updated 6 months ago
- VisualGPTScore for visio-linguistic reasoning☆27Updated last year
- Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"☆14Updated 5 months ago
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆12Updated 8 months ago
- Prompt Generation Networks for Input-Space Adaptation of Frozen Vision Transformers. Jochem Loedeman, Maarten C. Stol, Tengda Han, Yuki M…☆42Updated 11 months ago
- (CVPR 2024) "Unsegment Anything by Simulating Deformation"☆28Updated last year
- Create generated datasets and train robust classifiers☆36Updated last year
- Implementation and dataset for paper "Can MLLMs Perform Text-to-Image In-Context Learning?"☆40Updated 2 months ago
- [NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"☆177Updated last year
- Compress conventional Vision-Language Pre-training data☆51Updated last year
- [TIP] Exploring Effective Factors for Improving Visual In-Context Learning☆19Updated last month
- Adapting LLaMA Decoder to Vision Transformer☆29Updated last year
- Code and data for the paper "Emergent Visual-Semantic Hierarchies in Image-Text Representations" (ECCV 2024)☆29Updated 11 months ago
- ☆46Updated last year
- Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"☆32Updated last year
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆41Updated 8 months ago
- ☆10Updated last year
- Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆56Updated 2 years ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆32Updated 4 months ago
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆17Updated 10 months ago
- [CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation☆62Updated last week