Code, data, models for the Sherlock corpus
☆62Nov 11, 2022Updated 3 years ago
Alternatives and similar repositories for sherlock
Users that are interested in sherlock are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2022] Visual Abductive Reasoning☆124Oct 22, 2024Updated last year
- incremental symbol learning for natural language understanding☆10Jun 12, 2023Updated 3 years ago
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…☆10May 9, 2024Updated 2 years ago
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆45Jun 14, 2024Updated 2 years ago
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆34Sep 16, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆15Jun 4, 2025Updated last year
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 3 years ago
- Official Repo for the Paper "AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution o…☆26Jan 12, 2025Updated last year
- Social Chemistry 101: Learning to Reason about Social and Moral Norms☆35Mar 17, 2023Updated 3 years ago
- ☆12Nov 30, 2023Updated 2 years ago
- A collections of papers about VQA-CP datasets and their results☆42Mar 18, 2022Updated 4 years ago
- ☆23Feb 8, 2025Updated last year
- Official code for "Evaluations of Machine Learning Privacy Defenses are Misleading" (https://arxiv.org/abs/2404.17399)☆13Apr 29, 2024Updated 2 years ago
- A simple evaluation of generative language models and safety classifiers.☆100Apr 15, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆71Feb 28, 2024Updated 2 years ago
- ☆27Aug 28, 2023Updated 2 years ago
- Touchstone: Evaluating Vision-Language Models by Language Models☆83Jan 18, 2024Updated 2 years ago
- Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"☆30May 12, 2026Updated last month
- LaTeX template files for dissertations and theses formatted according to UCLA graduate division's requirements☆15Jul 11, 2022Updated 3 years ago
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution☆27Mar 18, 2021Updated 5 years ago
- KARL: Knowledge-Aware Reasoning and Reinforcement Learning for Knowledge-Intensive Visual Grounding☆68Apr 5, 2026Updated 2 months ago
- Code for ModularQA☆27Jun 8, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.☆954Mar 19, 2025Updated last year
- ☆16Jan 3, 2023Updated 3 years ago
- FunQA benchmarks funny, creative, and magic videos for challenging tasks including timestamp localization, video description, reasoning, …☆104Dec 25, 2025Updated 5 months ago
- Comparative evaluation of image-to-image translation methods for stain transfer in histopathology☆29Feb 14, 2024Updated 2 years ago
- ☆20Apr 14, 2023Updated 3 years ago
- A multimodal context reasoning approach that introduce the multi-view semantic alignment information via prefix tuning.☆15Sep 14, 2023Updated 2 years ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆37Apr 3, 2023Updated 3 years ago
- Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]☆139Apr 10, 2026Updated 2 months ago
- Dataset and code for EMNLP 2022 "Visual Named Entity Linking: A New Dataset and A Baseline"☆28Apr 16, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆23Dec 4, 2024Updated last year
- ☆10Jul 23, 2021Updated 4 years ago
- ☆27Oct 30, 2023Updated 2 years ago
- Code accompanying paper "Fine-Grained Visual Entailment" [ECCV 2022].☆11Oct 31, 2022Updated 3 years ago
- ☆21Oct 10, 2023Updated 2 years ago
- A spoken version of the textual story cloze benchmark☆22Aug 6, 2023Updated 2 years ago
- An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"☆52Aug 13, 2023Updated 2 years ago