microsoft/genalog

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/genalog)

microsoft / genalog

Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.

☆358

Alternatives and similar repositories for genalog

Users that are interested in genalog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

clovaai / synthtiger
View on GitHub
Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021
☆579Jun 14, 2024Updated 2 years ago
DocCreator / DocCreator
View on GitHub
DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation
☆138Nov 29, 2023Updated 2 years ago
biswassanket / synth_doc_generation
View on GitHub
Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021
☆93Jul 16, 2021Updated 5 years ago
gchhablani / multilingual-image-captioning
View on GitHub
☆43Aug 2, 2021Updated 4 years ago
sayakpaul / Handwriting-Recognizer-in-Keras
View on GitHub
This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.
☆13Aug 15, 2021Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
PrithivirajDamodaran / vision-language-modelling-series
View on GitHub
Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations
☆14Aug 16, 2022Updated 3 years ago
webis-de / small-text
View on GitHub
Active Learning for Text Classification in Python
☆646May 24, 2026Updated 2 months ago
Layout-Parser / layout-parser
View on GitHub
A Unified Toolkit for Deep Learning Based Document Image Analysis
☆5,765Aug 15, 2024Updated last year
jpWang / LiLT
View on GitHub
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…
☆366Oct 31, 2022Updated 3 years ago
clovaai / bros
View on GitHub
☆163Dec 27, 2022Updated 3 years ago
kaustubhdhole / natural-dont-know
View on GitHub
Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries
☆19Nov 29, 2021Updated 4 years ago
Belval / TextRecognitionDataGenerator
View on GitHub
A synthetic data generator for text recognition
☆3,681Jul 18, 2024Updated 2 years ago
shabie / docformer
View on GitHub
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…
☆290Feb 13, 2023Updated 3 years ago
clovaai / cord
View on GitHub
CORD: A Consolidated Receipt Dataset for Post-OCR Parsing
☆485Jul 20, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
qurator-spk / eynollah
View on GitHub
Document Layout Analysis
☆408Updated this week
longshangbang / UnrealText
View on GitHub
Synthetic Scene Text from 3D Engines
☆251Aug 6, 2020Updated 5 years ago
wangwen-whu / WTW-Dataset
View on GitHub
This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure …
☆184Sep 15, 2021Updated 4 years ago
sparkfish / augraphy
View on GitHub
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
☆561Jul 20, 2025Updated last year
SCUT-DLVCLab / Document-AI-Recommendations
View on GitHub
Algorithms, papers, datasets, performance comparisons for Document AI.
☆209Mar 1, 2025Updated last year
facebookresearch / AugLy
View on GitHub
A data augmentations library for audio, image, text, and video.
☆5,087Jul 16, 2026Updated last week
mindee / doctr
View on GitHub
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning. Ongo…
☆6,190Updated this week
ronghanghu / vqa-maskrcnn-benchmark-m4c
View on GitHub
Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…
☆13Jan 30, 2020Updated 6 years ago
wenwenyu / PICK-pytorch
View on GitHub
Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICP…
☆568Jul 25, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
AlibabaResearch / AdvancedLiterateMachinery
View on GitHub
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…
☆1,833Mar 17, 2026Updated 4 months ago
fh2019ustc / DocTr
View on GitHub
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
☆436Jul 10, 2026Updated 2 weeks ago
clovaai / spade
View on GitHub
☆82Jun 12, 2023Updated 3 years ago
open-mmlab / mmocr
View on GitHub
OpenMMLab Text Detection, Recognition and Understanding Toolbox
☆4,747Nov 27, 2024Updated last year
tstanislawek / awesome-document-understanding
View on GitHub
A curated list of resources for Document Understanding (DU) topic
☆1,525Jun 2, 2023Updated 3 years ago
AkshitaJha / NLP_CSS_2017
View on GitHub
☆10Jul 27, 2018Updated 7 years ago
microsoft / xtreme-distil-transformers
View on GitHub
XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale
☆157Dec 20, 2023Updated 2 years ago
MetricsDI / DIMetrics
View on GitHub
☆10May 25, 2022Updated 4 years ago
IBM / SynthTabNet
View on GitHub
Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files
☆154Sep 17, 2025Updated 10 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
phamquiluan / jdeskew
View on GitHub
ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation
☆168May 5, 2026Updated 2 months ago
rossumai / docile
View on GitHub
DocILE: Document Information Localization and Extraction Benchmark
☆149Jun 17, 2026Updated last month
carted / processing-text-data
View on GitHub
Presents an optimized Apache Beam pipeline for generating sentence embeddings (runnable on Cloud Dataflow).
☆20Mar 7, 2022Updated 4 years ago
PrithivirajDamodaran / Alt-ZSC
View on GitHub
Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…
☆37Apr 5, 2022Updated 4 years ago
ofirpress / shortformer
View on GitHub
Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.
☆147Jul 26, 2021Updated 4 years ago
N-Almarwani / DCT_Sentence_Embedding
View on GitHub
Efficient-Sentence-Embedding-using-Discrete-Cosine-Transform
☆17Jul 2, 2020Updated 6 years ago
doc-analysis / DocBank
View on GitHub
DocBank: A Benchmark Dataset for Document Layout Analysis
☆653Aug 12, 2024Updated last year