Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
☆355Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for genalog
Users that are interested in genalog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021☆574Jun 14, 2024Updated last year
- DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation☆137Nov 29, 2023Updated 2 years ago
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021☆93Jul 16, 2021Updated 4 years ago
- ☆44Aug 2, 2021Updated 4 years ago
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.☆13Aug 15, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Aug 16, 2022Updated 3 years ago
- A Unified Toolkit for Deep Learning Based Document Image Analysis☆5,731Aug 15, 2024Updated last year
- Active Learning for Text Classification in Python☆643Updated this week
- ☆162Dec 27, 2022Updated 3 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆364Oct 31, 2022Updated 3 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Nov 29, 2021Updated 4 years ago
- A synthetic data generator for text recognition☆3,672Jul 18, 2024Updated last year
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆288Feb 13, 2023Updated 3 years ago
- CORD: A Consolidated Receipt Dataset for Post-OCR Parsing☆478Jul 20, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Document Layout Analysis☆402May 11, 2026Updated last week
- This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure …☆183Sep 15, 2021Updated 4 years ago
- Synthetic Scene Text from 3D Engines☆251Aug 6, 2020Updated 5 years ago
- A data augmentations library for audio, image, text, and video.☆5,083May 15, 2026Updated last week
- Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes☆536Jul 20, 2025Updated 10 months ago
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆6,094May 12, 2026Updated last week
- Algorithms, papers, datasets, performance comparisons for Document AI.☆208Mar 1, 2025Updated last year
- Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICP…☆569Jul 25, 2024Updated last year
- The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.☆428Jun 18, 2025Updated 11 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,828Mar 17, 2026Updated 2 months ago
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- ☆10May 25, 2022Updated 3 years ago
- OpenMMLab Text Detection, Recognition and Understanding Toolbox☆4,733Nov 27, 2024Updated last year
- ☆81Jun 12, 2023Updated 2 years ago
- A curated list of resources for Document Understanding (DU) topic☆1,514Jun 2, 2023Updated 2 years ago
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆153Sep 17, 2025Updated 8 months ago
- ☆10Jul 27, 2018Updated 7 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Dec 20, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- DocILE: Document Information Localization and Extraction Benchmark☆147May 15, 2024Updated 2 years ago
- CodeQUEST is a generalizable framework which leverages LLMs to iteratively evaluate and enhance code quality across multiple dimensions f…☆17Feb 11, 2026Updated 3 months ago
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆167May 5, 2026Updated 2 weeks ago
- Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs☆18Mar 2, 2020Updated 6 years ago
- Presents an optimized Apache Beam pipeline for generating sentence embeddings (runnable on Cloud Dataflow).☆20Mar 7, 2022Updated 4 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Apr 5, 2022Updated 4 years ago