zijianchen98 / OBI-BenchLinks
[ICLR'25] The first benchmark aiming to evaluate whether LMMs can assist oracle bone inscription processing tasks
☆14Updated 2 months ago
Alternatives and similar repositories for OBI-Bench
Users that are interested in OBI-Bench are comparing it to the libraries listed below
Sorting:
- Evaluating GPT-4o's image generation and editing ability in OCR tasks.☆47Updated 2 months ago
- Text Image Inpainting via Global Structure-Guided Diffusion Models (Accepted by AAAI-24)☆69Updated 2 months ago
- Official code implementation of " TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image " in Pattern Recognition☆24Updated last year
- Oracle Bone Script data collected by VLRLab of HUST☆47Updated 9 months ago
- [NeurIPS2024 D&B Spotlight] GAIA: Rethinking Action Quality Assessment for AI-Generated Videos☆27Updated 2 months ago
- Update the latest text-related papers from top conferences☆25Updated 2 months ago
- AGIQA-1k-Database for AI Generated Content Image Quality Assessment☆27Updated 2 years ago
- The official project of paper "Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing"☆66Updated this week
- [ACL 2024 Best Paper] Deciphering Oracle Bone Language with Diffusion Models☆104Updated last month
- Reproducing the Past: A Dataset for Benchmarking Inscription Restoration (Accepted by ACM MM'24, Oral)☆12Updated last month
- ④[ECCV 2024 Oral, Comparison among Multiple Images!] A study on open-ended multi-image quality comparison: a dataset, a model and a bench…☆79Updated 8 months ago
- PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution (ACMMM 2024)☆41Updated 5 months ago
- [IEEE TCSVT2023] A Fine-grained Subjective Perception & Alignment Database for AI Generated Image Quality Assessment☆63Updated last year
- ☆15Updated 3 months ago
- [AAAI2024] Official PyTorch implementation of VQ-Font: Few-Shot Font Generation with Structure-Aware Enhancement and Quantization.☆48Updated last year
- ☆52Updated 10 months ago
- [ACMMM 2024] AesExpert: Towards Multi-modality Foundation Model for Image Aesthetics Perception☆83Updated 4 months ago
- A collection of AI-generated images papers and corresponding source code/demo program, including text-to-image, image translation (e.g., …☆12Updated last year
- Official released code for VQA² series models☆44Updated last month
- Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20…☆52Updated 11 months ago
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆45Updated 10 months ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆32Updated 2 months ago
- 【ICDAR 2024】Coarse-to-Fine Document Image Registration for Dewarping☆18Updated 10 months ago
- The public code for "PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts"☆24Updated 3 months ago
- Project of AI3604 Computer Vision, 2023 Fall, SJTU☆20Updated 8 months ago
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆40Updated 8 months ago
- Official codes for "Q-Ground: Image Quality Grounding with Large Multi-modality Models", ACM MM2024 (Oral)☆41Updated 7 months ago
- ☆23Updated 3 months ago
- ☆20Updated 10 months ago
- The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer☆53Updated 11 months ago