An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR
☆15Dec 4, 2021Updated 4 years ago
Alternatives and similar repositories for taco-box
Users that are interested in taco-box are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆11Dec 1, 2022Updated 3 years ago
- ☆18Apr 11, 2023Updated 3 years ago
- ☆18May 28, 2025Updated 10 months ago
- Implementation of the paper "Binaural Sound Source Distance Estimation and Localization for a Moving Listener"☆17Mar 2, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Nov 25, 2022Updated 3 years ago
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- HRTF database of HATS from KAIST☆19Feb 28, 2023Updated 3 years ago
- ☆16Feb 16, 2023Updated 3 years ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆39Dec 2, 2023Updated 2 years ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- [ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation☆13May 13, 2023Updated 2 years ago
- Cross-lingual learning in scene text recognition (ICASSP2024)☆18Sep 29, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Extracts per-sentence subtitles + audio from a subtitle file + video file.☆12Oct 1, 2019Updated 6 years ago
- This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.☆14Feb 24, 2022Updated 4 years ago
- Two-Step Approach to OCR Post-Correction☆14May 24, 2024Updated last year
- ☆18Jul 9, 2024Updated last year
- Intuitive interface for fine-tuning and retraining a Tesseract OCR language model☆10Jul 4, 2025Updated 9 months ago
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Sep 22, 2023Updated 2 years ago
- The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)☆27Sep 3, 2023Updated 2 years ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆83Sep 12, 2023Updated 2 years ago
- Cornell INFO 3350: Text mining for history and literature, Fall 2020☆10Jan 14, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Tropy plugin for exporting items into Omeka☆11Apr 20, 2023Updated 2 years ago
- The source codes of TDv2 in paper: TDv2: A Novel Tree-Structured Decoder for Offline Mathematical Expression Recognition.☆12Jul 28, 2022Updated 3 years ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Jul 26, 2022Updated 3 years ago
- ☆14Oct 21, 2022Updated 3 years ago
- ☆44Feb 7, 2023Updated 3 years ago
- Public implementation of our CVPR Paper "OrigamiNet: Weakly-Supervised, Segmentation-Free, One-Step, Full Page TextRecognition by learnin…☆147Oct 12, 2021Updated 4 years ago
- Attention-based sequence-to-sequence model for handwritten word recognition☆65Sep 22, 2024Updated last year
- Tools for normalizing the use of some characters and checking file consistencies☆11Jan 12, 2026Updated 3 months ago
- Image readout, processing and SLAM library☆11Jun 3, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆21Jul 24, 2019Updated 6 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- Conversions between various OCR formats☆84Feb 13, 2026Updated 2 months ago
- BibTeX files with information about all publications from the annual International Conference on New Interfaces for Musical Expression …☆33Dec 20, 2025Updated 3 months ago
- VIAF via Python☆13Updated this week
- ☆19Mar 28, 2022Updated 4 years ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆11Mar 10, 2019Updated 7 years ago