zijianchen98 / OBI-BenchLinks
[ICLR'25] The first benchmark aiming to evaluate whether LMMs can assist oracle bone inscription processing tasks
☆20Updated 8 months ago
Alternatives and similar repositories for OBI-Bench
Users that are interested in OBI-Bench are comparing it to the libraries listed below
Sorting:
- ☆13Updated last year
- Assessing Context-Aware Creative Intelligence in MLLMs☆23Updated 4 months ago
- [arXiv 25] Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR☆243Updated 3 months ago
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆22Updated 4 months ago
- MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]☆22Updated last week
- Oracle Bone Script data collected by VLRLab of HUST☆63Updated last year
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆84Updated last year
- [NeurIPS 2025 Spotlight] Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆77Updated 3 months ago
- [NeurIPS'24] I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing☆27Updated last week
- 🔥🔥[NeurIPS2025]Exploring and mitigating semantic hallucinations in scene text perception and reasoning☆21Updated last week
- [NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment☆58Updated last year
- [ICLR'25] Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training☆46Updated 10 months ago
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆44Updated last year
- Official implement of MIA-DPO☆67Updated 10 months ago
- Doodling our way to AGI ✏️ 🖼️ 🧠☆118Updated 6 months ago
- [IJCV 2025] Smaller But Better: Unifying Layout Generation with Smaller Large Language Models☆149Updated 4 months ago
- MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models☆31Updated 10 months ago
- A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation☆85Updated 2 months ago
- Official repository for CoMM Dataset☆48Updated 11 months ago
- Text Image Inpainting via Global Structure-Guided Diffusion Models (Accepted by AAAI-24)☆74Updated 8 months ago
- [ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models☆92Updated last year
- EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling☆180Updated last month
- Continuous diffusion for layout generation☆52Updated 10 months ago
- a unified reinforcement learning toolbox for joint RL on language models and diffusion models☆33Updated last month
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆63Updated last year
- Unified layout planning and image generation, ICCV2025☆39Updated 8 months ago
- ☆53Updated 9 months ago
- ☆46Updated last month
- [AAAI 2025]Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning☆41Updated 8 months ago
- (ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆44Updated 5 months ago