tianyi-lab / ColorBenchLinks
[NeurIPS'25] ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness
☆29Updated 2 months ago
Alternatives and similar repositories for ColorBench
Users that are interested in ColorBench are comparing it to the libraries listed below
Sorting:
- official repo for the paper "EXIF as Language: Learning Cross-Modal Associations Between Images and Camera Metadata"☆49Updated 2 years ago
- Training Autoregressive Image Generation models via Reinforcement Learning☆48Updated 3 weeks ago
- Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…☆57Updated last year
- ☆93Updated 9 months ago
- Official Repository of Personalized Visual Instruct Tuning☆33Updated 9 months ago
- ☆46Updated last month
- ☆56Updated 7 months ago
- [ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"☆82Updated last year
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆84Updated last year
- Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation☆32Updated 5 months ago
- ☆23Updated 6 months ago
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆35Updated 7 months ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆66Updated last year
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated last year
- we propose FlexEdit, an end-to-end image editing method that leverages both free-shape masks and language instructions for Flexible Editi…☆32Updated last year
- Concept Lancet: Image Editing with Compositional Representation Transplant (CVPR 2025)☆19Updated 8 months ago
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆63Updated 4 months ago
- Replication in Visual Diffusion Models: A Survey and Outlook☆31Updated last year
- ☆14Updated last year
- ☆41Updated last year
- ☆18Updated last year
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆40Updated 9 months ago
- Repo for "Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content"☆38Updated 6 months ago
- ☆26Updated 9 months ago
- Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing☆49Updated last year
- Official code for ICLR 2024 paper "Do Generated Data Always Help Contrastive Learning?"☆31Updated last year
- fixed official code for paper "A Closer Look at Parameter-Efficient Tuning in Diffusion Models".☆42Updated 2 years ago
- [CVPR 2025] PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models☆50Updated 6 months ago
- Official PyTorch codes for "Enhancing Diffusion Models with Text-Encoder Reinforcement Learning", ECCV2024☆57Updated last year
- [AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning☆17Updated 2 weeks ago