IBM/KVP10k

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/IBM/KVP10k)

IBM / KVP10k

Repository for the KVP10k dataset

☆23

Alternatives and similar repositories for KVP10k

Users that are interested in KVP10k are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TenMilesLotus / DTSM
View on GitHub
Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator
☆13Apr 28, 2024Updated 2 years ago
SCUT-DLVCLab / RFUND
View on GitHub
[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…
☆21Dec 4, 2024Updated last year
lcy0604 / CTRNet-plus
View on GitHub
The official implement of CTRNet++.
☆15Dec 30, 2024Updated last year
clovaai / webvicob
View on GitHub
Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023
☆110Oct 24, 2023Updated 2 years ago
whlscut / DocLayLLM
View on GitHub
[CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding
☆30Dec 18, 2025Updated 7 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
uakarsh / TiLT-Implementation
View on GitHub
Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.
☆18Apr 23, 2023Updated 3 years ago
saifullah3396 / docxclassifier
View on GitHub
☆17Jul 11, 2024Updated 2 years ago
shengfly / writer-identification
View on GitHub
☆11Jun 3, 2025Updated last year
harrytea / UDoc-GAN
View on GitHub
Official PyTorch implementation for ACM MM22 "UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior"
☆25Aug 5, 2024Updated last year
valentyn1boreiko / SVCEs_code
View on GitHub
☆13Jun 23, 2022Updated 4 years ago
jfkuang / CFAM
View on GitHub
Contrast-guided Feature Adjustment Module for Visual Information Extraction
☆30May 23, 2023Updated 3 years ago
RichSu95 / Document_Binarization_Collection
View on GitHub
This repository is a concise collection of well known deep learning based document binarization models.
☆30Dec 24, 2022Updated 3 years ago
Sreyan88 / DALE
View on GitHub
Code for EMNLP 2023 paper: DALE: Generative Data Augmentation for Low-Resource Legal NLP
☆11Oct 27, 2023Updated 2 years ago
HCIILAB / LAST
View on GitHub
Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition
☆28Aug 29, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
A-Ijishakin / Contrast-DiffAE
View on GitHub
☆15Aug 8, 2023Updated 2 years ago
chongzhangFDU / Token-Path-Prediction-Datasets
View on GitHub
This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…
☆17Mar 20, 2024Updated 2 years ago
peterbhase / ExplanationSearch
View on GitHub
Code for paper "Search Methods for Sufficient, Socially-Aligned Feature Importance Explanations with In-Distribution Counterfactuals"
☆18Oct 17, 2022Updated 3 years ago
NExTplusplus / TAT-DQA
View on GitHub
TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning
☆24Sep 17, 2024Updated last year
darrow-labs / LegalLens
View on GitHub
☆10Jul 15, 2024Updated 2 years ago
chenllliang / CTDNN
View on GitHub
MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition
☆11Dec 4, 2021Updated 4 years ago
baoy-nlp / DSS-VAE-pytorch
View on GitHub
Generating Sentences from Disentangled Syntactic and Semantic Spaces
☆11Jun 24, 2019Updated 7 years ago
yufanchen96 / RoDLA
View on GitHub
RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
☆39Mar 26, 2025Updated last year
pkunlp-icler / SCL-RAI
View on GitHub
Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022
☆11Aug 20, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
GeorgeLuImmortal / DocLLM_reimplementation
View on GitHub
☆22Mar 18, 2024Updated 2 years ago
clovaai / spade
View on GitHub
☆82Jun 12, 2023Updated 3 years ago
JiahaoHuang99 / DAGAN_PyTorch
View on GitHub
Re-implement DAGAN in the PyTorch
☆13Jan 29, 2022Updated 4 years ago
JierunChen / SFT-RL-SynergyDilemma
View on GitHub
☆15Jan 14, 2026Updated 6 months ago
ZZZHANG-jx / GCDRNet
View on GitHub
[TAI 2023] Appearance Enhancement for Camera-captured Document Images in the Wild
☆58Aug 28, 2025Updated 10 months ago
gabriel-salomon / ufpr-admr-v2-dataset
View on GitHub
UFPR-ADMR-v2: a dataset for image-based automatic meter reading containing 5,000 dial meter images obtained on-site by employees of the E…
☆11Oct 22, 2022Updated 3 years ago
xhli-git / DocSAM
View on GitHub
☆33Apr 8, 2025Updated last year
daviddongkc / DocOIE
View on GitHub
Released Code for ACL 21 paper: DocOIE A Document-level Context-Aware Dataset for OpenIE
☆15Nov 25, 2022Updated 3 years ago
liyingxuan1012 / zeroshot-speaker-prediction
View on GitHub
Official repository of "Zero-Shot Character Identification and Speaker Prediction in Comics via Iterative Multimodal Fusion" (ACMMM 2024)
☆16Oct 31, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
microsoft / UDOP
View on GitHub
☆250Jan 22, 2023Updated 3 years ago
jonathan-roberts1 / SciFIBench
View on GitHub
NeurIPS 2024: SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
☆13May 24, 2025Updated last year
SivilTaram / FollowUp
View on GitHub
public dataset for followup-query analysis, accepted by AAAI2019
☆15Aug 22, 2019Updated 6 years ago
ZZZHANG-jx / WMeter-Reader
View on GitHub
[TIM 2025] Towards Accurate Readings of Water Meters by Eliminating Transition Error: New Dataset and Effective Solution
☆19Mar 5, 2025Updated last year
lcy0604 / QT-TextSR
View on GitHub
This repository is the implementation of "QT-TextSR: Enhancing scene text image super-resolution via efficient interaction with text reco…
☆20Jul 9, 2025Updated last year
nttmdlab-nlp / InstructDoc
View on GitHub
InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)
☆162May 31, 2024Updated 2 years ago
izhx / uni-rep
View on GitHub
Code for embedding and retrieval research.
☆16Oct 24, 2023Updated 2 years ago