🔥 [ICLR 2025] Official Benchmark Toolkits for "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"
☆41Nov 21, 2025Updated 4 months ago
Alternatives and similar repositories for vhs_benchmark
Users that are interested in vhs_benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🔥 [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"☆26Feb 9, 2025Updated last year
- A tool for calling (and calling out to) large language models.☆16Aug 13, 2024Updated last year
- Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents☆105Mar 10, 2026Updated last month
- 🔥 [NeurIPS 2025] Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospe…☆56Jan 22, 2026Updated 2 months ago
- 🍀 Official pytorch implementation of "D2ADA: Dynamic Density-aware Active Domain Adaptation for Semantic Segmentation. Wu et al. ECCV 20…☆25Feb 2, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)☆20Dec 4, 2021Updated 4 years ago
- ArXiV Notification Bot which sends you an email with the latest updates!☆15Oct 20, 2023Updated 2 years ago
- ☆22Apr 24, 2025Updated 11 months ago
- ☆18Sep 15, 2025Updated 6 months ago
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs☆55Mar 9, 2025Updated last year
- VHTest☆16Oct 31, 2024Updated last year
- [ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning☆52Apr 7, 2026Updated last week
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Jul 31, 2023Updated 2 years ago
- CLAIR: A (surprisingly) simple semantic text metric with large language models.☆22Jan 28, 2024Updated 2 years ago
- ☆10Sep 25, 2019Updated 6 years ago
- Text Summarization on Spotify Podcast Transcripts for NLP class at @UNIBO☆17Jul 2, 2022Updated 3 years ago
- Code and Data for ACL 2023 paper I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors☆16Jun 7, 2023Updated 2 years ago
- ☆29Sep 2, 2025Updated 7 months ago
- Holistic evaluation of multimodal foundation models☆49Aug 11, 2024Updated last year
- ☆20Jul 23, 2025Updated 8 months ago
- [EMNLP 2025 Demo] Extracting internal representations from vision-language models. Beta version.☆120Mar 10, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for Slow Transition to Low-Dimensional Chaos in Heavy-Tailed Recurrent Neural Networks (NeurIPS 2025)☆20Mar 16, 2026Updated 3 weeks ago
- Code for "Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes"☆56Mar 28, 2024Updated 2 years ago
- slices in group meetings☆12Nov 29, 2020Updated 5 years ago
- 用Kinova Gen3实机实现Rekep☆11Mar 18, 2025Updated last year
- ☆41Jul 24, 2024Updated last year
- Koishi's Day 2025 Paper (NeurIPS 2025): "Codifying Character Logic in Role-Playing"☆23Jan 15, 2026Updated 2 months ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- ReKep Experiment on UR5 based on kinova arm☆14Apr 25, 2025Updated 11 months ago
- Official repository for Decentralized Arena via Collective LLM Intelligence☆17May 19, 2025Updated 10 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆34Aug 9, 2023Updated 2 years ago
- ☆24Aug 2, 2024Updated last year
- A Python tool to pull the complete edit history of a Wikipedia page☆21Jan 13, 2026Updated 3 months ago
- Code for EMNLP 2020 paper: Analogous Process Structure Induction for Sub-event Sequence Prediction☆11Oct 19, 2020Updated 5 years ago
- Official code repository for the paper: AbsPyramid: Benchmarking the Abstration Ability of Language Models with a Unified Entailment Grap…☆13Oct 30, 2024Updated last year
- This is the code repo for Findings of EMNLP2022 paper: MICO: a multi-alternative contrastive learning framework for commonsense knowledg…☆10Nov 29, 2022Updated 3 years ago
- ☆13Oct 23, 2024Updated last year