The WordScape repository contains code for the WordScape pipeline to create datasets to train document understanding models.
☆41Dec 7, 2023Updated 2 years ago
Alternatives and similar repositories for WordScape
Users that are interested in WordScape are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Sep 6, 2024Updated last year
- ☆70Jan 9, 2024Updated 2 years ago
- Index of URLs to pdf files all over the internet and scripts☆24May 2, 2023Updated 3 years ago
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆162May 31, 2024Updated 2 years ago
- Dataset and scripts for HRDoc☆41Jun 21, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆37Jan 26, 2026Updated 4 months ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Nov 28, 2024Updated last year
- ☆40Aug 18, 2021Updated 4 years ago
- JSON Schema format for storing datasets details, documents processed contents, and documents annotations in the document understanding do…☆14Nov 5, 2024Updated last year
- Ongoing research project for code&math LLMs☆31Jul 4, 2025Updated 11 months ago
- ☆15Apr 12, 2023Updated 3 years ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021☆576Jun 14, 2024Updated last year
- ☆19Jul 7, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆61Aug 18, 2021Updated 4 years ago
- ☆27Feb 20, 2024Updated 2 years ago
- The most comprehensive Chinese Telegraph Code table☆13Jul 5, 2015Updated 10 years ago
- v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoning☆20Oct 6, 2025Updated 8 months ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.☆28Jul 6, 2022Updated 3 years ago
- ☆42Sep 2, 2023Updated 2 years ago
- This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…☆17Mar 20, 2024Updated 2 years ago
- weixin125个人健康数据管理系统的设计与实现微信小程序+ssm后端毕业源码案例设计☆11Feb 28, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Repo for the paper: Towards Few-shot Entity Recognition in Document Images:A Label-aware Sequence-to-Sequence Framework☆14May 31, 2023Updated 3 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- TrOCR but 2 to 3 times faster☆11Oct 22, 2022Updated 3 years ago
- HOCR Specification Python Parser☆12Sep 23, 2015Updated 10 years ago
- A curated list of resources about long-context in large-language models and video understanding.☆32Aug 8, 2023Updated 2 years ago
- ☆10Aug 5, 2019Updated 6 years ago
- ☆19Feb 5, 2026Updated 4 months ago
- Using TensorRT accelerate Segformer.☆11Oct 6, 2023Updated 2 years ago
- Code for paper 'Data-Efficient FineTuning'☆28May 24, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along w…☆348Aug 22, 2024Updated last year
- An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.☆107Nov 15, 2023Updated 2 years ago
- ☆11Jul 31, 2022Updated 3 years ago
- Official implementation of the ANLS* metric☆24Updated this week
- ☆14Jan 11, 2022Updated 4 years ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆84Jan 30, 2023Updated 3 years ago
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆52Aug 26, 2024Updated last year