π₯ [ICLR 2025] Official Benchmark Toolkits for "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"
β40Nov 21, 2025Updated 4 months ago
Alternatives and similar repositories for vhs_benchmark
Users that are interested in vhs_benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π₯ [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"β26Feb 9, 2025Updated last year
- π₯ [NeurIPS 2025] Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospeβ¦β54Jan 22, 2026Updated 2 months ago
- π Official pytorch implementation of "D2ADA: Dynamic Density-aware Active Domain Adaptation for Semantic Segmentation. Wu et al. ECCV 20β¦β25Feb 2, 2023Updated 3 years ago
- π₯ [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"β47Jun 16, 2024Updated last year
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and β¦β11Jun 18, 2024Updated last year
- β13Jun 11, 2024Updated last year
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)β20Dec 4, 2021Updated 4 years ago
- ArXiV Notification Bot which sends you an email with the latest updates!β15Oct 20, 2023Updated 2 years ago
- β23Apr 24, 2025Updated 11 months ago
- β18Sep 15, 2025Updated 6 months ago
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMsβ55Mar 9, 2025Updated last year
- CoV: Chain-of-View Prompting for Spatial Reasoningβ52Jan 23, 2026Updated 2 months ago
- VHTestβ16Oct 31, 2024Updated last year
- Pytorch Datasets for Easy-To-Hardβ29Jan 9, 2025Updated last year
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".β16May 3, 2022Updated 3 years ago
- β12Jul 31, 2023Updated 2 years ago
- CLAIR: A (surprisingly) simple semantic text metric with large language models.β22Jan 28, 2024Updated 2 years ago
- β10Sep 25, 2019Updated 6 years ago
- βοΈ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".β53Feb 23, 2026Updated last month
- Text Summarization on Spotify Podcast Transcripts for NLP class at @UNIBOβ17Jul 2, 2022Updated 3 years ago
- Code and Data for ACL 2023 paper I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphorsβ16Jun 7, 2023Updated 2 years ago
- β29Sep 2, 2025Updated 6 months ago
- Official code repository for the main conference paper in EMNLP 2022: SubeventWriter: Iterative Sub-event Sequence Generation with Cohereβ¦β11Oct 16, 2022Updated 3 years ago
- Holistic evaluation of multimodal foundation modelsβ49Aug 11, 2024Updated last year
- β20Nov 13, 2023Updated 2 years ago
- Repository containing necessary files to run a server able to run Webots simulationβ12Jan 3, 2025Updated last year
- Open sourced result for The Agent Companyβ21Nov 11, 2025Updated 4 months ago
- [EMNLP 2025 Demo] Extracting internal representations from vision-language models. Beta version.β117Mar 10, 2026Updated 2 weeks ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?β35Apr 27, 2023Updated 2 years ago
- a set of tools for computer vision processingβ18Jul 9, 2016Updated 9 years ago
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Populaβ¦β11Oct 18, 2022Updated 3 years ago
- Code release for "UnSAMv2: Self-Supervised Learning Enables Segment Anything at Any Granularity"β79Feb 1, 2026Updated last month
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuningβ35Aug 9, 2023Updated 2 years ago
- Google Drive CLI Clientβ21Dec 20, 2022Updated 3 years ago
- Official code repository for the paper: AbsPyramid: Benchmarking the Abstration Ability of Language Models with a Unified Entailment Grapβ¦β13Oct 30, 2024Updated last year
- This is the code repo for Findings of EMNLP2022 paper: MICO: a multi-alternative contrastive learning framework for commonsense knowledgβ¦β10Nov 29, 2022Updated 3 years ago
- β13Oct 23, 2024Updated last year
- β13Apr 23, 2025Updated 11 months ago
- β15Mar 31, 2022Updated 3 years ago