[NeurIPS'25] ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness
☆34Sep 27, 2025Updated 5 months ago
Alternatives and similar repositories for ColorBench
Users that are interested in ColorBench are comparing it to the libraries listed below
Sorting:
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated 11 months ago
- [CVPR 2025] PyTorch implementation of T-CORE, introduced in "When the Future Becomes the Past: Taming Temporal Correspondence for Self-su…☆17Nov 4, 2025Updated 4 months ago
- A simple template for theoretical computer science assignments☆11Sep 6, 2023Updated 2 years ago
- Towards Scale-Aware Low-Light Enhancement via Structure-Guided Transformer Design☆26Apr 23, 2025Updated 10 months ago
- Official repository for "On the Multi-modal Vulnerability of Diffusion Models"☆16Jul 15, 2024Updated last year
- A unified robotic manipulation learning framework☆21Sep 4, 2025Updated 6 months ago
- Learning Safety Constraints for Large Language Models (ICML2025)☆32Aug 4, 2025Updated 7 months ago
- (NeurIPS 2025) Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"☆47Jun 3, 2025Updated 9 months ago
- [CVPR 2025] PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models☆51Jun 12, 2025Updated 9 months ago
- ☆31Feb 26, 2026Updated 3 weeks ago
- A python package for DICOM to NifTi and NifTi to DICOM-SEG and GSPS conversion☆12Sep 25, 2023Updated 2 years ago
- [ICCV25] LD-RPS☆28Jul 17, 2025Updated 8 months ago
- Textual Inversion for DeepFloyd IF☆61Sep 19, 2023Updated 2 years ago
- LLM - Detect AI Generated Text || Identify which essay was written by a large language model☆17Jan 17, 2024Updated 2 years ago
- ☆19Jun 27, 2025Updated 8 months ago
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆21Jul 16, 2025Updated 8 months ago
- [ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆90Oct 15, 2024Updated last year
- ☆12Apr 26, 2022Updated 3 years ago
- Combined InstantID🔥 and FouriScale to generate high resolution image!☆11Apr 3, 2024Updated last year
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆43Jun 28, 2024Updated last year
- ☆22Apr 3, 2025Updated 11 months ago
- ☆28Nov 28, 2025Updated 3 months ago
- This is official repository for "LIR: Efficient Degradation Removal for Lightweight Image Restoration"☆14Jun 9, 2024Updated last year
- ☆17Nov 26, 2024Updated last year
- ☆14Dec 25, 2024Updated last year
- [ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision☆72Jul 10, 2024Updated last year
- Code for the paper: "Modular Neural Image Signal Processing". A modular neural ISP with interpretable stages, multi-style rendering, cros…☆33Jan 19, 2026Updated 2 months ago
- Code for "SePPO: Semi-Policy Preference Optimization for Diffusion Alignment."☆18Oct 7, 2024Updated last year
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆25Sep 26, 2024Updated last year
- ☆15Nov 13, 2025Updated 4 months ago
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆27Nov 2, 2024Updated last year
- [IJCV 2026] HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts☆26Feb 28, 2025Updated last year
- Official pytorch implementation of WACV 2023 Paper "Proactive Deepfake Defence via Identity Watermarking" for both training and evaluatio…☆24Feb 21, 2023Updated 3 years ago
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs☆37Mar 9, 2025Updated last year
- [ICCV 2025] Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges☆84Feb 27, 2025Updated last year
- Comprehensive Assessment of Trustworthiness in Multimodal Foundation Models☆27Mar 15, 2025Updated last year
- [🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …☆28Nov 1, 2025Updated 4 months ago