kartikgill/taco-box

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kartikgill/taco-box)

kartikgill / taco-box

An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR

☆15

Alternatives and similar repositories for taco-box

Users that are interested in taco-box are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kartikgill / Autoencoders
View on GitHub
☆12Jan 25, 2021Updated 5 years ago
kartikgill / The-GAN-Book
View on GitHub
The GAN Book: Train stable Generative Adversarial Networks using TensorFlow2, Keras and Python.
☆21Apr 12, 2024Updated 2 years ago
DCGM / SoftCTC
View on GitHub
This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135
☆19Mar 7, 2023Updated 3 years ago
PacktPublishing / The-Definitive-Guide-to-Google-Vertex-AI
View on GitHub
☆32Mar 2, 2026Updated 4 months ago
georgeretsi / Seq2Emb
View on GitHub
Create handwritten word embeddings from a text recognition Seq2Seq system.
☆11Dec 1, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
kartikgill / Easter2
View on GitHub
Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION
☆79Apr 25, 2023Updated 3 years ago
AprilYapingZhang / awesome-ocr
View on GitHub
☆18Apr 11, 2023Updated 3 years ago
kartikgill / TF2-Keras-GAN-Notebooks
View on GitHub
Generative Adversarial Networks with TensorFlow2, Keras and Python (Jupyter Notebooks Implementations)
☆37Dec 19, 2021Updated 4 years ago
anabild / mlops
View on GitHub
MLOPS examples
☆12Mar 22, 2023Updated 3 years ago
LARS-research / TREFE
View on GitHub
Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022
☆13Nov 25, 2022Updated 3 years ago
ihdia / seamformer
View on GitHub
Official repository accompaying the ICDAR 2023 paper
☆14Oct 3, 2023Updated 2 years ago
FactoDeepLearning / LinePytorchOCR
View on GitHub
☆17Feb 16, 2023Updated 3 years ago
jarobyte91 / post_ocr_correction
View on GitHub
Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"
☆39Dec 2, 2023Updated 2 years ago
HCIILAB / LAST
View on GitHub
Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition
☆28Aug 29, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
onealwj / MVLT
View on GitHub
PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition
☆28Nov 11, 2022Updated 3 years ago
callsys / FlowText
View on GitHub
[ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation
☆13May 13, 2023Updated 3 years ago
Pay20Y / PIMNet
View on GitHub
☆16Jan 30, 2022Updated 4 years ago
cessen / subs_extract
View on GitHub
Extracts per-sentence subtitles + audio from a subtitle file + video file.
☆12Oct 1, 2019Updated 6 years ago
thanhnghiadk / syntactic_HME_generation
View on GitHub
This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.
☆14Feb 24, 2022Updated 4 years ago
bingo-todd / WaveLoc
View on GitHub
End-to-End binaural sound localization
☆17Feb 27, 2020Updated 6 years ago
samanthadoran / effective-guacamole
View on GitHub
☆10Jul 4, 2022Updated 4 years ago
Xiaomeng-Yang / STR_benchmark_cleansed
View on GitHub
☆14May 26, 2023Updated 3 years ago
SII-sc22mc / DocFusion
View on GitHub
A Unified Framework for Document Parsing Tasks (Including Document Layout Analysis, OCR, Formula Recognition, and Table Recognition)
☆15Jul 1, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
EDM-Research / VATr-pp
View on GitHub
☆18Jul 9, 2024Updated 2 years ago
qurator-spk / sbb_ocr_postcorrection
View on GitHub
Two-Step Approach to OCR Post-Correction
☆14May 24, 2024Updated 2 years ago
ThunderVVV / RCLSTR
View on GitHub
Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`
☆17Sep 22, 2023Updated 2 years ago
dshea89 / tesseract-retraining-pipeline
View on GitHub
Intuitive interface for fine-tuning and retraining a Tesseract OCR language model
☆10Jul 4, 2025Updated last year
CyrilSterling / LPV
View on GitHub
The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)
☆26Sep 3, 2023Updated 2 years ago
Alpha-Innovator / DocParser
View on GitHub
☆18Jan 13, 2025Updated last year
amazon-science / semimtr-text-recognition
View on GitHub
Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)
☆83Sep 12, 2023Updated 2 years ago
ShengKuangCN / BAST
View on GitHub
☆18May 28, 2025Updated last year
han-saram / HRTF-HATS-KAIST
View on GitHub
HRTF database of HATS from KAIST
☆19Feb 28, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yqingli123 / TDv2
View on GitHub
The source codes of TDv2 in paper: TDv2: A Novel Tree-Structured Decoder for Offline Mathematical Expression Recognition.
☆12Jul 28, 2022Updated 4 years ago
tropy / tropy-plugin-omeka
View on GitHub
Tropy plugin for exporting items into Omeka
☆11Apr 20, 2023Updated 3 years ago
amazon-science / textadain-robust-recognition
View on GitHub
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
☆21Jul 26, 2022Updated 4 years ago
Qiangest / DeepEar
View on GitHub
DeepEar: Sound Localization with Binaural Microphones
☆16Nov 20, 2025Updated 8 months ago
ugent-library / mmmonk-annotation-demo
View on GitHub
☆14Oct 21, 2022Updated 3 years ago
dmitrijsk / AttentionHTR
View on GitHub
Attention-based sequence-to-sequence model for handwritten word recognition
☆65Sep 22, 2024Updated last year
IntuitionMachines / OrigamiNet
View on GitHub
Public implementation of our CVPR Paper "OrigamiNet: Weakly-Supervised, Segmentation-Free, One-Step, Full Page TextRecognition by learnin…
☆147Oct 12, 2021Updated 4 years ago