allenai / sherlockView external linksLinks
Code, data, models for the Sherlock corpus
☆59Nov 11, 2022Updated 3 years ago
Alternatives and similar repositories for sherlock
Users that are interested in sherlock are comparing it to the libraries listed below
Sorting:
- [CVPR 2022] Visual Abductive Reasoning☆124Oct 22, 2024Updated last year
- ☆75Apr 4, 2024Updated last year
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆45Jun 14, 2024Updated last year
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…☆10May 9, 2024Updated last year
- incremental symbol learning for natural language understanding☆10Jun 12, 2023Updated 2 years ago
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆14Jun 4, 2025Updated 8 months ago
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Mar 29, 2023Updated 2 years ago
- Official code for "Evaluations of Machine Learning Privacy Defenses are Misleading" (https://arxiv.org/abs/2404.17399)☆12Apr 29, 2024Updated last year
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆34Sep 16, 2023Updated 2 years ago
- ☆17Jun 12, 2024Updated last year
- ☆12Nov 30, 2023Updated 2 years ago
- An abductive reasoning engine written in C++.☆13Dec 28, 2018Updated 7 years ago
- A spoken version of the textual story cloze benchmark☆20Aug 6, 2023Updated 2 years ago
- ☆20Feb 8, 2025Updated last year
- ☆16Jan 3, 2023Updated 3 years ago
- Touchstone: Evaluating Vision-Language Models by Language Models☆83Jan 18, 2024Updated 2 years ago
- PyTorch code for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles (DANCE)☆23Nov 29, 2022Updated 3 years ago
- [ECCV 2024] BenchLMM: Benchmarking Cross-style Visual Capability of Large Multimodal Models☆86Aug 19, 2024Updated last year
- Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection☆28Oct 12, 2021Updated 4 years ago
- Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]☆136Sep 29, 2024Updated last year
- VisualCOMET: Reasoning about the Dynamic Context of a Still Image☆88Jun 12, 2023Updated 2 years ago
- ☆27Oct 30, 2023Updated 2 years ago
- baseline mode for the ObjectNet competition☆18Jan 13, 2021Updated 5 years ago
- Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"☆28Jul 31, 2024Updated last year
- Dataset and code for EMNLP 2022 "Visual Named Entity Linking: A New Dataset and A Baseline"☆27Apr 16, 2023Updated 2 years ago
- ☆27Jul 20, 2024Updated last year
- ☆27Mar 21, 2024Updated last year
- Comparative evaluation of image-to-image translation methods for stain transfer in histopathology☆28Feb 14, 2024Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- ☆30Jun 12, 2023Updated 2 years ago
- Official repository for the MMFM challenge☆25Jun 18, 2024Updated last year
- Code for ModularQA☆28Jun 8, 2021Updated 4 years ago
- The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.☆26Feb 22, 2024Updated last year
- ☆27Aug 28, 2023Updated 2 years ago
- ☆32Feb 8, 2024Updated 2 years ago
- ☆58Dec 2, 2025Updated 2 months ago
- Dataset and starting code for visual entailment dataset☆118Apr 21, 2022Updated 3 years ago
- MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.☆952Mar 19, 2025Updated 10 months ago