π₯ [ICLR 2025] Official Benchmark Toolkits for "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"
β42Nov 21, 2025Updated 6 months ago
Alternatives and similar repositories for vhs_benchmark
Users that are interested in vhs_benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π₯ [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"β26Feb 9, 2025Updated last year
- β22Apr 30, 2026Updated last month
- π₯ [NeurIPS 2025] Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospeβ¦β57Jan 22, 2026Updated 4 months ago
- Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agentsβ111May 3, 2026Updated last month
- π₯ [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"β46Jun 16, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for Dataset and Benchmarks Submission, Neurips 2022β13Aug 16, 2022Updated 3 years ago
- β14Jun 11, 2024Updated 2 years ago
- Automatically Analyze your Model Tracesβ45Mar 16, 2026Updated 2 months ago
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)β20Dec 4, 2021Updated 4 years ago
- ArXiV Notification Bot which sends you an email with the latest updates!β17Oct 20, 2023Updated 2 years ago
- β22Apr 24, 2025Updated last year
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMsβ56Mar 9, 2025Updated last year
- Tensorflow + Keras Algorithms repository for the CannyLabβ10Feb 9, 2022Updated 4 years ago
- Work-in-progress unofficial asynchronous API wrapper for Whatnot API.β13Apr 18, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- VHTestβ16Oct 31, 2024Updated last year
- Iterate on LLM-based structured generation forward and backwardβ23Mar 20, 2025Updated last year
- [ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoningβ61Apr 7, 2026Updated 2 months ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".β16May 3, 2022Updated 4 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!β25May 14, 2026Updated last month
- CLAIR: A (surprisingly) simple semantic text metric with large language models.β22Jan 28, 2024Updated 2 years ago
- βοΈ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".β59Feb 23, 2026Updated 3 months ago
- β10Sep 25, 2019Updated 6 years ago
- Code for "Recognizing Scenes from Novel Viewpoints"β29Sep 16, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code and Data for ACL 2023 paper I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphorsβ17Jun 7, 2023Updated 3 years ago
- β30Sep 2, 2025Updated 9 months ago
- Official code repository for the main conference paper in EMNLP 2022: SubeventWriter: Iterative Sub-event Sequence Generation with Cohereβ¦β11Oct 16, 2022Updated 3 years ago
- Holistic evaluation of multimodal foundation modelsβ48Aug 11, 2024Updated last year
- β21Nov 13, 2023Updated 2 years ago
- β22Jul 23, 2025Updated 10 months ago
- Code for Slow Transition to Low-Dimensional Chaos in Heavy-Tailed Recurrent Neural Networks (NeurIPS 2025)β20Mar 16, 2026Updated 2 months ago
- SLIP is a sandbox environment for engineering protein sequences with synthetic fitness functions.β21Jan 17, 2024Updated 2 years ago
- β26May 9, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- β44Jul 24, 2024Updated last year
- Koishi's Day 2025 Paper (NeurIPS 2025): "Codifying Character Logic in Role-Playing"β24Jan 15, 2026Updated 4 months ago
- This GenAI demo project, built with CrewAI and AutoGen, showcases potential security risks associated with AI agents.β17May 1, 2025Updated last year
- a set of tools for computer vision processingβ18Jul 9, 2016Updated 9 years ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?β35Apr 27, 2023Updated 3 years ago
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Populaβ¦β11Oct 18, 2022Updated 3 years ago
- Code release for "UnSAMv2: Self-Supervised Learning Enables Segment Anything at Any Granularity"β81Feb 1, 2026Updated 4 months ago