kartikgill / taco-box
An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR
☆15Updated 3 years ago
Alternatives and similar repositories for taco-box:
Users that are interested in taco-box are comparing it to the libraries listed below
- ☆15Updated 2 years ago
- OCR & Ground Truth Resources☆74Updated 2 years ago
- Official repository accompaying the ICDAR 2023 paper☆11Updated last year
- Hadwritten Text Recognition in Few-shot Scenario☆20Updated last year
- Attention-based sequence-to-sequence model for handwritten word recognition☆56Updated 5 months ago
- PyTorch implementation of STR models for transfer learning in Indic Languages☆16Updated 3 years ago
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆10Updated 2 years ago
- ☆72Updated 2 years ago
- ☆22Updated 2 years ago
- ☆80Updated last year
- time-series row column classification☆13Updated 3 years ago
- CTE: Contextualized Table Extraction Dataset☆17Updated 2 years ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆17Updated 2 years ago
- AI_DocumentLayoutAnalysis☆38Updated 4 years ago
- ☆19Updated 3 years ago
- ☆18Updated last year
- ☆21Updated 2 years ago
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆28Updated 5 years ago
- DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction☆19Updated last year
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago
- ☆18Updated last year
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆23Updated 3 years ago
- Wrapper around pixel classifier☆9Updated 2 years ago
- Geometric Augmentation for Text Image☆9Updated 4 years ago
- A dataset of region-annotated scientific articles.☆21Updated 5 years ago
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆21Updated 3 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 3 years ago
- DFKI Layout Detection for OCR-D☆47Updated 4 months ago
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Updated 4 years ago
- ☆25Updated 4 years ago