kaist-ami / BEAFLinks

[ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"

☆21

Alternatives and similar repositories for BEAF

Users that are interested in BEAF are comparing it to the libraries listed below

Sorting:

ytaek-oh / vl_compo
☆10Updated last year
kwonjunn01 / Hi-Mapper
☆15Updated 11 months ago
amitakamath / vl_text_encoders_are_bottlenecks
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11Updated 2 years ago
ExplainableML / ImageSelect
Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"
☆27Updated 2 years ago
naver-ai / prolip
☆55Updated 3 months ago
chenshuang-zhang / imagenet_d
[CVPR 2024 Highlight] ImageNet-D
☆44Updated last year
tripletclip / TripletCLIP
[NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"
☆45Updated 11 months ago
k1rezaei / Text-to-concept
☆35Updated last year
McGill-NLP / diffusion-itm
Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"
☆33Updated last year
uvavision / SyViC
[ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data
☆13Updated 2 years ago
TAU-VAILab / hierarcaps
Code and data for the paper "Emergent Visual-Semantic Hierarchies in Image-Text Representations" (ECCV 2024)
☆32Updated last year
alhojel / visual_task_vectors
☆39Updated last year
sterzhang / PVIT
Official Repository of Personalized Visual Instruct Tuning
☆32Updated 8 months ago
aszala / VPEval
VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)
☆44Updated last year
hjbahng / cyclereward
CycleReward is a reward model trained on cycle consistency preferences to measure image-text alignment.
☆52Updated 2 weeks ago
UW-Madison-Lee-Lab / CoBSAT
Implementation and dataset for paper "Can MLLMs Perform Text-to-Image In-Context Learning?"
☆41Updated 5 months ago
ExplainableML / DataDream
[ECCV 2024] Official repository for "DataDream: Few-shot Guided Dataset Generation"
☆46Updated last year
kdariina / CLIP-not-BoW-unimodally
Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"
☆16Updated 9 months ago
object-understanding / SLASH
☆23Updated 2 years ago
jaehong31 / SAFREE
[ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generation
☆45Updated 10 months ago
alinlab / b2t
Bias-to-Text: Debiasing Unknown Visual Biases through Language Interpretation
☆31Updated 2 years ago
ys-zong / MIRB
Benchmarking Multi-Image Understanding in Vision and Language Models
☆12Updated last year
Luodian / GenBench
Benchmarking and Analyzing Generative Data for Visual Recognition
☆26Updated 2 years ago
adobe-research / llava-score
☆11Updated last year
eric-ai-lab / Discffusion
Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"
☆30Updated last year
orrzohar / LOVM
[NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selection
☆21Updated last year
ethanlshen / HierNet
Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…
☆21Updated 2 years ago
see-say-segment / sesame
🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"
☆45Updated last year
Zi-hao-Wei / Efficient-Vision-Language-Pre-training-by-Cluster-Masking
[CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.
☆29Updated last year
hammoudhasan / SynthCLIP
Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.
☆101Updated 7 months ago