Code, data, models for the Sherlock corpus
☆60Nov 11, 2022Updated 3 years ago
Alternatives and similar repositories for sherlock
Users that are interested in sherlock are comparing it to the libraries listed below
Sorting:
- [CVPR 2022] Visual Abductive Reasoning☆124Oct 22, 2024Updated last year
- Corpus to accompany: "Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding"☆11Apr 11, 2025Updated 10 months ago
- ☆75Apr 4, 2024Updated last year
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆45Jun 14, 2024Updated last year
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…☆10May 9, 2024Updated last year
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆15Jun 4, 2025Updated 9 months ago
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Mar 29, 2023Updated 2 years ago
- Official code for "Evaluations of Machine Learning Privacy Defenses are Misleading" (https://arxiv.org/abs/2404.17399)☆12Apr 29, 2024Updated last year
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- Comparison-based Machine Learning in Python☆21Jun 16, 2024Updated last year
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆34Sep 16, 2023Updated 2 years ago
- ☆12Nov 30, 2023Updated 2 years ago
- ☆17Jun 12, 2024Updated last year
- An abductive reasoning engine written in C++.☆13Dec 28, 2018Updated 7 years ago
- Official Repo for the Paper "AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution o…☆23Jan 12, 2025Updated last year
- Social Chemistry 101: Learning to Reason about Social and Moral Norms☆34Mar 17, 2023Updated 2 years ago
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆71Feb 28, 2024Updated 2 years ago
- ☆21Feb 8, 2025Updated last year
- ☆16Jan 3, 2023Updated 3 years ago
- Official Code Repo for the Paper: "How does This Interaction Affect Me? Interpretable Attribution for Feature Interactions", In NeurIPS 2…☆42Oct 31, 2022Updated 3 years ago
- [NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selection☆21Feb 3, 2024Updated 2 years ago
- ☆21Oct 10, 2023Updated 2 years ago
- Touchstone: Evaluating Vision-Language Models by Language Models☆83Jan 18, 2024Updated 2 years ago
- Natural language understanding by probabilistic abduction of a symbolic theory from sentences and logical forms.☆17Jun 13, 2025Updated 8 months ago
- [ECCV 2024] BenchLMM: Benchmarking Cross-style Visual Capability of Large Multimodal Models☆86Aug 19, 2024Updated last year
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆19Oct 4, 2022Updated 3 years ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆22Dec 4, 2024Updated last year
- Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection☆28Oct 12, 2021Updated 4 years ago
- ☆20Apr 14, 2023Updated 2 years ago
- Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]☆137Sep 29, 2024Updated last year
- ☆27Oct 30, 2023Updated 2 years ago
- ☆26Nov 21, 2022Updated 3 years ago
- ☆27Jul 20, 2024Updated last year
- Dataset and code for EMNLP 2022 "Visual Named Entity Linking: A New Dataset and A Baseline"☆27Apr 16, 2023Updated 2 years ago
- Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"☆29Jul 31, 2024Updated last year
- DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding☆66Jun 10, 2025Updated 8 months ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆64Oct 19, 2024Updated last year
- ☆31Sep 7, 2023Updated 2 years ago
- Comparative evaluation of image-to-image translation methods for stain transfer in histopathology☆28Feb 14, 2024Updated 2 years ago