zijianchen98 / OBI-BenchLinks
[ICLR'25] The first benchmark aiming to evaluate whether LMMs can assist oracle bone inscription processing tasks
☆20Updated 9 months ago
Alternatives and similar repositories for OBI-Bench
Users that are interested in OBI-Bench are comparing it to the libraries listed below
Sorting:
- [arXiv 25] Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR☆245Updated 4 months ago
- ☆13Updated last year
- [IJCV 2025] Smaller But Better: Unifying Layout Generation with Smaller Large Language Models☆149Updated 5 months ago
- [NeurIPS'24] I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing☆29Updated last month
- [ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models☆92Updated last year
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆84Updated last year
- ☆53Updated 9 months ago
- Text Image Inpainting via Global Structure-Guided Diffusion Models (Accepted by AAAI-24)☆75Updated 9 months ago
- Official Implementation of OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation☆37Updated 6 months ago
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆63Updated last year
- [NeurIPS 2025 Spotlight] Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆78Updated 3 months ago
- Oracle Bone Script data collected by VLRLab of HUST☆65Updated last year
- EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling☆191Updated last month
- Compositional Inversion for Stable Diffusion Models (AAAI 2024)☆37Updated 10 months ago
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆44Updated last year
- Doodling our way to AGI ✏️ 🖼️ 🧠☆120Updated 7 months ago
- [ICCV2025] A Token-level Text Image Foundation Model for Document Understanding☆129Updated 4 months ago
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆90Updated last year
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆33Updated 9 months ago
- [NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment☆59Updated last year
- Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…☆60Updated last year
- [IEEE TPAMI 2025] Privacy-Preserving Biometric Verification With Handwritten Random Digit String☆65Updated 5 months ago
- 🔥🔥[NeurIPS2025]Exploring and mitigating semantic hallucinations in scene text perception and reasoning☆23Updated last month
- Unified layout planning and image generation, ICCV2025☆40Updated 8 months ago
- ☆18Updated last year
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆22Updated 5 months ago
- A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation☆86Updated 3 months ago
- Training A Small Emotional Vision Language Model for Visual Art Comprehension☆15Updated last year
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆78Updated 9 months ago
- a collection of awesome autoregressive visual generation models☆79Updated 8 months ago