The WordScape repository contains code for the WordScape pipeline to create datasets to train document understanding models.
☆41Dec 7, 2023Updated 2 years ago
Alternatives and similar repositories for WordScape
Users that are interested in WordScape are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆70Jan 9, 2024Updated 2 years ago
- Index of URLs to pdf files all over the internet and scripts☆24May 2, 2023Updated 3 years ago
- Binarizing Documents by Leveraging both Space and Frequency. (ICDAR 2024)☆18May 15, 2025Updated last year
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆162May 31, 2024Updated 2 years ago
- Dataset and scripts for HRDoc☆41Jun 21, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Create TensorRT-runtime for vietocr☆12Jun 8, 2021Updated 5 years ago
- ☆37Jan 26, 2026Updated 5 months ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Nov 28, 2024Updated last year
- HTML in Python☆13Jul 19, 2024Updated last year
- JSON Schema format for storing datasets details, documents processed contents, and documents annotations in the document understanding do…☆14Nov 5, 2024Updated last year
- ☆15Apr 12, 2023Updated 3 years ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021☆578Jun 14, 2024Updated 2 years ago
- Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.☆22Dec 7, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Reproduction paper --- PDFTriage : Question Answering over Long, Structured Documents☆42Jan 16, 2024Updated 2 years ago
- ☆19Jul 7, 2025Updated 11 months ago
- ☆61Aug 18, 2021Updated 4 years ago
- ☆27Feb 20, 2024Updated 2 years ago
- ☆83Apr 12, 2022Updated 4 years ago
- Repository for the KVP10k dataset☆23Sep 18, 2025Updated 9 months ago
- The most comprehensive Chinese Telegraph Code table☆13Jul 5, 2015Updated 10 years ago
- v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoning☆20Oct 6, 2025Updated 8 months ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.☆28Jul 6, 2022Updated 3 years ago
- ☆42Sep 2, 2023Updated 2 years ago
- This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…☆17Mar 20, 2024Updated 2 years ago
- weixin125个人健康数据管理系统的设计与实现微信小程序+ssm后端毕业源码案例设计☆11Feb 28, 2024Updated 2 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- HOCR Specification Python Parser☆12Sep 23, 2015Updated 10 years ago
- ☆10Aug 5, 2019Updated 6 years ago
- ☆19Feb 5, 2026Updated 4 months ago
- Official implementation of the ANLS* metric☆25Jun 22, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆52Aug 26, 2024Updated last year
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆84Jan 30, 2023Updated 3 years ago
- 票据识别合成样本☆12Apr 23, 2021Updated 5 years ago
- Online visual analytics tool designed to investigate how attention maps in transformer models behaves, and build hypothesis on those mode…☆10Nov 10, 2021Updated 4 years ago
- Self Evolving Large Multimodal Models with Continuous Rewards☆24Jun 9, 2026Updated 3 weeks ago
- DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models☆155Jan 13, 2025Updated last year
- NNVisBuilder and some cases including KD-t☆12Nov 18, 2023Updated 2 years ago