Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
☆357Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for genalog
Users that are interested in genalog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021☆578Jun 14, 2024Updated 2 years ago
- DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation☆138Nov 29, 2023Updated 2 years ago
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021☆93Jul 16, 2021Updated 4 years ago
- ☆43Aug 2, 2021Updated 4 years ago
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.☆13Aug 15, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Unified Toolkit for Deep Learning Based Document Image Analysis☆5,750Aug 15, 2024Updated last year
- Active Learning for Text Classification in Python☆645May 24, 2026Updated last month
- ☆163Dec 27, 2022Updated 3 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆366Oct 31, 2022Updated 3 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Nov 29, 2021Updated 4 years ago
- A synthetic data generator for text recognition☆3,682Jul 18, 2024Updated last year
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆290Feb 13, 2023Updated 3 years ago
- CORD: A Consolidated Receipt Dataset for Post-OCR Parsing☆482Jul 20, 2022Updated 3 years ago
- Document Layout Analysis☆406Jun 12, 2026Updated 2 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure …☆183Sep 15, 2021Updated 4 years ago
- Synthetic Scene Text from 3D Engines☆251Aug 6, 2020Updated 5 years ago
- A data augmentations library for audio, image, text, and video.☆5,087Jun 1, 2026Updated last month
- Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes☆551Jul 20, 2025Updated 11 months ago
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆6,156Jun 25, 2026Updated last week
- Algorithms, papers, datasets, performance comparisons for Document AI.☆209Mar 1, 2025Updated last year
- Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICP…☆568Jul 25, 2024Updated last year
- The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.☆434Jun 18, 2025Updated last year
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,831Mar 17, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- ☆10May 25, 2022Updated 4 years ago
- OpenMMLab Text Detection, Recognition and Understanding Toolbox☆4,743Nov 27, 2024Updated last year
- ☆82Jun 12, 2023Updated 3 years ago
- A curated list of resources for Document Understanding (DU) topic☆1,524Jun 2, 2023Updated 3 years ago
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆154Sep 17, 2025Updated 9 months ago
- ☆10Jul 27, 2018Updated 7 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Dec 20, 2023Updated 2 years ago
- DocILE: Document Information Localization and Extraction Benchmark☆148Jun 17, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CodeQUEST is a generalizable framework which leverages LLMs to iteratively evaluate and enhance code quality across multiple dimensions f…☆17Feb 11, 2026Updated 4 months ago
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆167May 5, 2026Updated last month
- Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs☆18Mar 2, 2020Updated 6 years ago
- Presents an optimized Apache Beam pipeline for generating sentence embeddings (runnable on Cloud Dataflow).☆20Mar 7, 2022Updated 4 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Apr 5, 2022Updated 4 years ago
- DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022☆190Jan 17, 2025Updated last year