naver-ai/cream

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/naver-ai/cream)

naver-ai / cream

Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023

☆46

Alternatives and similar repositories for cream

Users that are interested in cream are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

clovaai / webvicob
View on GitHub
Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023
☆110Oct 24, 2023Updated 2 years ago
naver-ai / scob
View on GitHub
Official Implementation of SCOB [ICCV 2023]
☆23Nov 16, 2023Updated 2 years ago
clovaai / units
View on GitHub
☆78Aug 7, 2023Updated 2 years ago
naver-ai / tablevqabench
View on GitHub
☆47May 21, 2024Updated 2 years ago
clovaai / bros
View on GitHub
☆163Dec 27, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
InsightsNet / texannotate
View on GitHub
☆38Jan 26, 2026Updated 6 months ago
herobd / dessurt
View on GitHub
Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer
☆62Jan 11, 2023Updated 3 years ago
hoangtuanvu / conformer_ocr
View on GitHub
Transformer OCR is a Optical Character Recognition tookit built for researchers working on both OCR for both Vietnamese and English. This…
☆10Dec 27, 2021Updated 4 years ago
WenjinW / LATIN-Prompt
View on GitHub
☆52May 28, 2024Updated 2 years ago
google-research-datasets / hiertext
View on GitHub
The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and p…
☆316Dec 2, 2024Updated last year
jfma-USTC / HRDoc
View on GitHub
Dataset and scripts for HRDoc
☆42Jun 21, 2023Updated 3 years ago
IBM / KVP10k
View on GitHub
Repository for the KVP10k dataset
☆23Sep 18, 2025Updated 10 months ago
kh-kim / nlp-express-practice
View on GitHub
☆10Jan 20, 2024Updated 2 years ago
clovaai / synthtiger
View on GitHub
Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021
☆579Jun 14, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
naver-ai / rdnet
View on GitHub
[ECCV2024] Official implementation of paper, "DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs".
☆155Aug 8, 2024Updated last year
nttmdlab-nlp / InstructDoc
View on GitHub
InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)
☆162May 31, 2024Updated 2 years ago
johnning2333 / M2Doc
View on GitHub
☆43Jun 15, 2024Updated 2 years ago
OSU-slatelab / MapQA
View on GitHub
☆15Jan 9, 2026Updated 6 months ago
rubenpt91 / MP-DocVQA-Framework
View on GitHub
☆72Jan 9, 2024Updated 2 years ago
HCIILAB / M6Doc
View on GitHub
☆166May 8, 2025Updated last year
clovaai / CLEval
View on GitHub
CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks
☆187Oct 17, 2023Updated 2 years ago
navervision / KELIP
View on GitHub
Official PyTorch implementation of "Large-scale Bilingual Language-Image Contrastive Learning" (ICLRW 2022)
☆96Apr 13, 2022Updated 4 years ago
ONground-Korea / 2023-AIKU_DeepLearning-Bootcamp
View on GitHub
2023-1 고려대학교 AIKU 딥러닝 방학 부트캠프: Deep into Deep
☆10Jul 10, 2023Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
microsoft / CompHRDoc
View on GitHub
Datasets and Evaluation Scripts for CompHRDoc
☆59Feb 25, 2025Updated last year
rosinality / tensorfn
View on GitHub
Weakly opinionated library for implementing ML models. Less boilerplate, More rigor
☆21Jul 1, 2022Updated 4 years ago
rosinality / nerf-pytorch
View on GitHub
☆21May 23, 2022Updated 4 years ago
google-research / pix2struct
View on GitHub
☆686Jul 8, 2026Updated 3 weeks ago
sparkfish / augraphy
View on GitHub
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
☆560Jul 20, 2025Updated last year
furkanbiten / idl_data
View on GitHub
OCR Annotations from Amazon Textract for Industry Documents Library
☆103Aug 20, 2022Updated 3 years ago
harrytea / TGDoc
View on GitHub
"Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023
☆16Nov 28, 2024Updated last year
Yuliang-Liu / AlphaOracle
View on GitHub
[Innovation 2026] Oracle bone script decipherment via human-workflow-inspired deep learning
☆31Jul 22, 2026Updated last week
MAEHCM / ICL-D3IE
View on GitHub
Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”
☆54Aug 8, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
GChrysostomou / ood_faith
View on GitHub
☆13Jul 26, 2023Updated 3 years ago
Yuliang-Liu / VimTS
View on GitHub
VimTS: A Unified Video and Image Text Spotter
☆79Nov 10, 2024Updated last year
HanSolo9682 / CounterCurate
View on GitHub
This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.
☆19Jun 27, 2024Updated 2 years ago
xinke-wang / Awesome-Text-VQA
View on GitHub
☆188May 8, 2024Updated 2 years ago
microsoft / UDOP
View on GitHub
☆250Jan 22, 2023Updated 3 years ago
Yuliang-Liu / MultimodalOCR
View on GitHub
On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)
☆873Jul 22, 2026Updated last week
xiangyue9607 / QVE
View on GitHub
Code for the ACL2022 paper "Synthetic Question Value Estimation for Domain Adaptation of Question Answering"
☆18Mar 21, 2022Updated 4 years ago