The WordScape repository contains code for the WordScape pipeline to create datasets to train document understanding models.
☆41Dec 7, 2023Updated 2 years ago
Alternatives and similar repositories for WordScape
Users that are interested in WordScape are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆70Jan 9, 2024Updated 2 years ago
- Index of URLs to pdf files all over the internet and scripts☆25May 2, 2023Updated 2 years ago
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆163May 31, 2024Updated last year
- Dataset and scripts for HRDoc☆41Jun 21, 2023Updated 2 years ago
- Create TensorRT-runtime for vietocr☆12Jun 8, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆37Jan 26, 2026Updated 3 months ago
- ☆40Aug 18, 2021Updated 4 years ago
- HTML in Python☆12Jul 19, 2024Updated last year
- JSON Schema format for storing datasets details, documents processed contents, and documents annotations in the document understanding do…☆14Nov 5, 2024Updated last year
- Ongoing research project for code&math LLMs☆31Jul 4, 2025Updated 9 months ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021☆576Jun 14, 2024Updated last year
- Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.☆22Dec 7, 2023Updated 2 years ago
- VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)☆57Mar 31, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆60Aug 18, 2021Updated 4 years ago
- ☆27Feb 20, 2024Updated 2 years ago
- ☆82Apr 12, 2022Updated 4 years ago
- Repository for the KVP10k dataset☆22Sep 18, 2025Updated 7 months ago
- The most comprehensive Chinese Telegraph Code table☆12Jul 5, 2015Updated 10 years ago
- v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoning☆19Oct 6, 2025Updated 6 months ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.☆28Jul 6, 2022Updated 3 years ago
- ☆42Sep 2, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [EMNLP2020] End-to-End Emotion-Cause Pair Extraction based on SlidingWindow Multi-Label Learning☆20Oct 13, 2020Updated 5 years ago
- This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…☆17Mar 20, 2024Updated 2 years ago
- weixin125个人健康数据管理系统的设计与实现微信小程序+ssm后端毕业源码案例设计☆11Feb 28, 2024Updated 2 years ago
- Repo for the paper: Towards Few-shot Entity Recognition in Document Images:A Label-aware Sequence-to-Sequence Framework☆14May 31, 2023Updated 2 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- ☆19Feb 5, 2026Updated 2 months ago
- Code for paper 'Data-Efficient FineTuning'☆28May 24, 2023Updated 2 years ago
- ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along w…☆346Aug 22, 2024Updated last year
- Official implementation of the ANLS* metric☆22Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14Jan 11, 2022Updated 4 years ago
- Source for Action Schema Networks paper (AAAI'18)☆32Apr 6, 2023Updated 3 years ago
- ☆14Jan 21, 2019Updated 7 years ago
- A Collection of Pydantic Models to Abstract IRL☆39Dec 10, 2025Updated 4 months ago
- Online visual analytics tool designed to investigate how attention maps in transformer models behaves, and build hypothesis on those mode…☆10Nov 10, 2021Updated 4 years ago
- Official implementation for "SPIRIT: Style-guided Patch Interaction for Fashion Image Retrieval with Text Feedback"☆16Oct 27, 2025Updated 6 months ago
- [CVPR 2026 (Findings) 🔥🔥] Self Evolving Large Multimodal Models with Continuous Rewards☆23Mar 5, 2026Updated last month