VLM2-Bench [ACL 2025 Main]: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues
☆45May 20, 2025Updated 10 months ago
Alternatives and similar repositories for VLM2-Bench
Users that are interested in VLM2-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official code repository for the paper "CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments…☆30Dec 10, 2025Updated 3 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- ☆13May 15, 2025Updated 10 months ago
- ☆23Aug 2, 2024Updated last year
- Public code repo for EMNLP 2024 Findings paper "MACAROON: Training Vision-Language Models To Be Your Engaged Partners"☆14Sep 28, 2024Updated last year
- Repository of <FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models>☆77Jan 8, 2026Updated 2 months ago
- AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)☆16Dec 30, 2024Updated last year
- ☆36Dec 19, 2025Updated 3 months ago
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆64Jan 28, 2026Updated last month
- MAPO: MIXED ADVANTAGE POLICY OPTIMIZATION☆39Sep 24, 2025Updated 6 months ago
- Code and data for the paper: IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Large Language Models …☆11Apr 27, 2024Updated last year
- ☆14Jan 6, 2025Updated last year
- [EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"☆15Aug 26, 2025Updated 6 months ago
- [ACL 2024] "Understanding and Patching Compositional Reasoning in LLMs"☆13Aug 28, 2024Updated last year
- ☆19Oct 14, 2024Updated last year
- Official Repository of Personalized Visual Instruct Tuning☆34Mar 6, 2025Updated last year
- [ICML 2024] Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language Models.☆86Jan 19, 2025Updated last year
- A Diagnostic Guardrail Framework for AI Agent Safety and Security☆409Updated this week
- Official Implementation for the paper "VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models"☆22Aug 14, 2025Updated 7 months ago
- awesome SAE papers☆73May 24, 2025Updated 10 months ago
- Official Implementation of VoxTracer (MM' 23)☆11Oct 27, 2023Updated 2 years ago
- 🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL☆59Aug 24, 2025Updated 7 months ago
- A Good Neighbor, A Found Treasure: Mining Treasured Neighbors for Knowledge Graph Entity Typing. EMNLP 2022☆11Feb 1, 2023Updated 3 years ago
- Official implement of ACL'25 Findings paper "MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Lang…☆21Jun 17, 2025Updated 9 months ago
- Train vector quantized CLIP models using pytorch lightning☆20Jul 14, 2024Updated last year
- ☆12Jun 12, 2024Updated last year
- ☆13Jan 16, 2025Updated last year
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆32Mar 10, 2025Updated last year
- Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning☆44Mar 18, 2026Updated last week
- [NeurIPS'25] Backdoor Cleaning without External Guidance in MLLM Fine-tuning☆18Oct 13, 2025Updated 5 months ago
- This is the code for "TARGET: Federated Class-Continual Learning via Exemplar-Free Distillation" (ICCV 2023)☆52Apr 30, 2024Updated last year
- A flexible & scalable MLLM-based AIGC detection pipeline☆31Oct 27, 2025Updated 4 months ago
- ☆45Jun 19, 2025Updated 9 months ago
- The re-implementation of <End-to-End Lane Marker Detection via Row-wise Classification>☆14Sep 21, 2020Updated 5 years ago
- [ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"☆12Dec 6, 2024Updated last year
- Official Implementation of the Baby-AIGS system☆24Nov 25, 2024Updated last year
- 武汉大学 iCalendar 校历☆12Updated this week
- [WWW24-UrbanCLIP] A comprehensive toolkit designed to facilitate the collection, processing, and integration of satellite imagery and ass…☆17Oct 6, 2024Updated last year
- The code used to train and run inference with MMDocIR☆32May 29, 2025Updated 9 months ago