Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
☆356Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for genalog
Users that are interested in genalog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021☆576Jun 14, 2024Updated last year
- DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation☆137Nov 29, 2023Updated 2 years ago
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021☆93Jul 16, 2021Updated 4 years ago
- ☆44Aug 2, 2021Updated 4 years ago
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.☆13Aug 15, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Aug 16, 2022Updated 3 years ago
- A Unified Toolkit for Deep Learning Based Document Image Analysis☆5,738Aug 15, 2024Updated last year
- Active Learning for Text Classification in Python☆644May 24, 2026Updated 2 weeks ago
- ☆163Dec 27, 2022Updated 3 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆366Oct 31, 2022Updated 3 years ago
- A synthetic data generator for text recognition☆3,676Jul 18, 2024Updated last year
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆289Feb 13, 2023Updated 3 years ago
- CORD: A Consolidated Receipt Dataset for Post-OCR Parsing☆481Jul 20, 2022Updated 3 years ago
- Document Layout Analysis☆405May 28, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure …☆183Sep 15, 2021Updated 4 years ago
- Synthetic Scene Text from 3D Engines☆251Aug 6, 2020Updated 5 years ago
- A data augmentations library for audio, image, text, and video.☆5,084Jun 1, 2026Updated last week
- Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes☆545Jul 20, 2025Updated 10 months ago
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆6,117Updated this week
- Algorithms, papers, datasets, performance comparisons for Document AI.☆209Mar 1, 2025Updated last year
- Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICP…☆569Jul 25, 2024Updated last year
- The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.☆433Jun 18, 2025Updated 11 months ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,829Mar 17, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- ☆10May 25, 2022Updated 4 years ago
- OpenMMLab Text Detection, Recognition and Understanding Toolbox☆4,737Nov 27, 2024Updated last year
- ☆81Jun 12, 2023Updated 2 years ago
- A curated list of resources for Document Understanding (DU) topic☆1,519Jun 2, 2023Updated 3 years ago
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆154Sep 17, 2025Updated 8 months ago
- ☆10Jul 27, 2018Updated 7 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Dec 20, 2023Updated 2 years ago
- DocILE: Document Information Localization and Extraction Benchmark☆146May 15, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆167May 5, 2026Updated last month
- Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs☆18Mar 2, 2020Updated 6 years ago
- Presents an optimized Apache Beam pipeline for generating sentence embeddings (runnable on Cloud Dataflow).☆20Mar 7, 2022Updated 4 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Apr 5, 2022Updated 4 years ago
- DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022☆188Jan 17, 2025Updated last year
- DocBank: A Benchmark Dataset for Document Layout Analysis☆646Aug 12, 2024Updated last year