[NeurIPS'24] Official implementation of paper "Unveiling the Tapestry of Consistency in Large Vision-Language Models".
☆38Oct 23, 2024Updated last year
Alternatives and similar repositories for ConBench
Users that are interested in ConBench are comparing it to the libraries listed below
Sorting:
- [NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment☆58Sep 26, 2024Updated last year
- [ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" and "Sp…☆241Dec 22, 2025Updated 2 months ago
- ☆17Oct 1, 2024Updated last year
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…☆24Jul 21, 2024Updated last year
- Implementation of "Single Shot Video Object Detector"☆23Mar 25, 2020Updated 5 years ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆24Sep 9, 2024Updated last year
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆26Jan 14, 2025Updated last year
- [ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model☆47Nov 10, 2024Updated last year
- VisualGPTScore for visio-linguistic reasoning☆27Oct 7, 2023Updated 2 years ago
- HallE-Control: Controlling Object Hallucination in LMMs☆31Apr 10, 2024Updated last year
- The official implementation of 《MLLMs-Augmented Visual-Language Representation Learning》☆31Mar 12, 2024Updated last year
- Bias-to-Text: Debiasing Unknown Visual Biases through Language Interpretation☆32May 21, 2023Updated 2 years ago
- Code for "Exploring Grounding Potential of VQA-oriented GPT-4V for Zero-shot Anomaly Detection"☆31Nov 7, 2023Updated 2 years ago
- RareAct: A video dataset of unusual interactions☆33Aug 4, 2020Updated 5 years ago
- Teeth Segmentation☆11Apr 14, 2024Updated last year
- [NeurIPS'25] MLLM-CompBench evaluates the comparative reasoning of MLLMs with 40K image pairs and questions across 8 dimensions of relati…☆41Apr 21, 2025Updated 10 months ago
- [CVPR 2025] Official implementation of paper "MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders".☆49Jun 7, 2025Updated 8 months ago
- Implementation of the "Learn No to Say Yes Better" paper.☆39Oct 30, 2025Updated 4 months ago
- ☆111Jan 8, 2025Updated last year
- [CVPR2025] Official code for Lost in Translation Found in Context☆23Jan 14, 2026Updated last month
- Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models☆48Sep 25, 2023Updated 2 years ago
- Optimized MDNet for fast object tracking☆10Apr 10, 2019Updated 6 years ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- Improving Continuous Sign Language Recognition with Adapted Image Models☆14Nov 10, 2025Updated 3 months ago
- The code for SCG: Saliency and Contour Guided Salient Instance Segmentation☆10Feb 22, 2023Updated 3 years ago
- Code/Models for Defending Against Universal Attacks Through Selective Feature Regeneration, CVPR 2020☆10Jul 31, 2020Updated 5 years ago
- A Practical Zoom-in GUI Grounding and Behavior-Based Evaluation method.☆20Dec 8, 2025Updated 2 months ago
- ☆10Jul 16, 2023Updated 2 years ago
- [NeurIPS 2024] MoVA: Adapting Mixture of Vision Experts to Multimodal Context☆174Sep 25, 2024Updated last year
- Normalized Wasserstein for Mixture Distributions☆11Mar 24, 2023Updated 2 years ago
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆20Jul 3, 2025Updated 8 months ago
- ICCV 2021 papers and code focus on adversarial attacks and defense☆11Nov 5, 2021Updated 4 years ago
- A digital twin of the city of Chicago along with automated sensors☆12Nov 14, 2019Updated 6 years ago
- [AAAI 2024] SAAS - Official PyTorch Implementation☆11Mar 28, 2024Updated last year
- EMIT: Enhancing MLLMs for Industrial Anomaly Detection via Difficulty-Aware GRPO☆20Jan 24, 2026Updated last month
- A scalable benchmark for state representation learning in visual reinforcement learning.☆16Jun 23, 2025Updated 8 months ago
- Multi-Person Tracking in Tour Guide Robot☆10Aug 23, 2022Updated 3 years ago
- Unexplored Faces of Robustness and Out-of-Distribution: Covariate Shifts in Environment and Sensor Domains (CVPR 2024)☆10Jan 17, 2026Updated last month
- [Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.☆14Jun 7, 2023Updated 2 years ago