The WordScape repository contains code for the WordScape pipeline to create datasets to train document understanding models.
☆39Dec 7, 2023Updated 2 years ago
Alternatives and similar repositories for WordScape
Users that are interested in WordScape are comparing it to the libraries listed below
Sorting:
- ☆14Sep 6, 2024Updated last year
- ☆37Jan 26, 2026Updated last month
- ☆69Jan 9, 2024Updated 2 years ago
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆162May 31, 2024Updated last year
- Export Donut model to onnx and run it with onnxruntime☆23Nov 21, 2023Updated 2 years ago
- ☆27Feb 20, 2024Updated 2 years ago
- Various experimental NLP tasks for Khmer language☆34Sep 27, 2020Updated 5 years ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021☆569Jun 14, 2024Updated last year
- It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.☆28Jul 6, 2022Updated 3 years ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆83Jan 30, 2023Updated 3 years ago
- A comprehensive paper list of Table-based Question Answering.☆37Sep 1, 2023Updated 2 years ago
- [CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents☆59May 26, 2025Updated 9 months ago
- A curated list of resources about long-context in large-language models and video understanding.☆32Aug 8, 2023Updated 2 years ago
- NNVisBuilder and some cases including KD-t☆12Nov 18, 2023Updated 2 years ago
- MRZ recognition from visa and passport documents.☆22Jan 13, 2026Updated last month
- Gurobi,GA,SA,PSO☆19Nov 18, 2019Updated 6 years ago
- ☆42Sep 2, 2023Updated 2 years ago
- ☆44Jul 9, 2024Updated last year
- A small project that uses Discrete Denoising Diffusion Probabilistic Models (D3PMs), a generative model for discrete data that builds upo…☆14Aug 10, 2024Updated last year
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆39Mar 26, 2025Updated 11 months ago
- ☆11Aug 3, 2023Updated 2 years ago
- PyTorch Implementation of the paper "Defining and Quantifying the Emergence of Sparse Concepts in DNNs" (CVPR 2023)☆12Dec 24, 2023Updated 2 years ago
- 🥑 Intellij plugin to optimization Vector Drawable 🥑☆11Apr 7, 2019Updated 6 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆44Jan 18, 2024Updated 2 years ago
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆48Aug 26, 2024Updated last year
- A simple result type for TypeScript☆10Oct 1, 2025Updated 4 months ago
- Online visual analytics tool designed to investigate how attention maps in transformer models behaves, and build hypothesis on those mode…☆10Nov 10, 2021Updated 4 years ago
- Tissue-specific variant annotation☆10Nov 19, 2018Updated 7 years ago
- CliniDeID automatically de-identifies clinical text notes according to the HIPAA Safe Harbor method. It accurately finds identifiers and …☆10Aug 13, 2023Updated 2 years ago
- ☆10Aug 1, 2018Updated 7 years ago
- ☆11May 5, 2022Updated 3 years ago
- ☆100Jan 3, 2024Updated 2 years ago
- ☆12Jul 25, 2023Updated 2 years ago
- Simple Drag and Drop component of multiple UITableView written in Swift☆13Jan 6, 2023Updated 3 years ago
- ☆28Sep 10, 2025Updated 5 months ago
- Official implementation of INTERSPECCH 2022 Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals☆16Sep 19, 2025Updated 5 months ago
- Android Studio Video Player☆15Apr 8, 2018Updated 7 years ago