kartikgill / taco-boxLinks
An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR
☆15Updated 3 years ago
Alternatives and similar repositories for taco-box
Users that are interested in taco-box are comparing it to the libraries listed below
Sorting:
- ☆15Updated 2 years ago
- PyTorch implementation of STR models for transfer learning in Indic Languages☆16Updated 3 years ago
- ☆18Updated 2 years ago
- OCR & Ground Truth Resources☆76Updated 3 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 3 years ago
- Official repository accompaying the ICDAR 2023 paper☆12Updated last year
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆10Updated 2 years ago
- ☆18Updated 2 years ago
- DFKI Layout Detection for OCR-D☆47Updated last month
- ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction☆27Updated 6 years ago
- ☆3Updated 2 weeks ago
- ☆25Updated 5 years ago
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆24Updated 4 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago
- Attention-based sequence-to-sequence model for handwritten word recognition☆59Updated 9 months ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Updated 2 years ago
- ☆1Updated last week
- ☆23Updated 2 years ago
- A dataset of region-annotated scientific articles.☆21Updated 5 years ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆73Updated 9 months ago
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆80Updated 2 years ago
- ☆3Updated 2 weeks ago
- CTE: Contextualized Table Extraction Dataset☆17Updated 2 years ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆37Updated last year
- Post-processing OCR errors with seq2seq models☆28Updated 4 years ago
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆42Updated last year
- Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis☆16Updated 3 years ago
- Geometric Augmentation for Text Image☆9Updated 5 years ago
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆21Updated 4 years ago
- ☆20Updated 3 years ago