Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
☆354Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for genalog
Users that are interested in genalog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021☆575Jun 14, 2024Updated last year
- DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation☆136Nov 29, 2023Updated 2 years ago
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021☆92Jul 16, 2021Updated 4 years ago
- ☆44Aug 2, 2021Updated 4 years ago
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.☆13Aug 15, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Aug 16, 2022Updated 3 years ago
- A Unified Toolkit for Deep Learning Based Document Image Analysis☆5,711Aug 15, 2024Updated last year
- Active Learning for Text Classification in Python☆638Apr 1, 2026Updated 2 weeks ago
- ☆161Dec 27, 2022Updated 3 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆363Oct 31, 2022Updated 3 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Nov 29, 2021Updated 4 years ago
- A synthetic data generator for text recognition☆3,660Jul 18, 2024Updated last year
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆287Feb 13, 2023Updated 3 years ago
- CORD: A Consolidated Receipt Dataset for Post-OCR Parsing☆472Jul 20, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Document Layout Analysis☆403Mar 27, 2026Updated 3 weeks ago
- This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure …☆183Sep 15, 2021Updated 4 years ago
- Synthetic Scene Text from 3D Engines☆251Aug 6, 2020Updated 5 years ago
- A data augmentations library for audio, image, text, and video.☆5,078Updated this week
- Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes☆523Jul 20, 2025Updated 8 months ago
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆6,006Mar 29, 2026Updated 2 weeks ago
- Algorithms, papers, datasets, performance comparisons for Document AI.☆206Mar 1, 2025Updated last year
- Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICP…☆569Jul 25, 2024Updated last year
- The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.☆425Jun 18, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,823Mar 17, 2026Updated last month
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- OpenMMLab Text Detection, Recognition and Understanding Toolbox☆4,728Nov 27, 2024Updated last year
- ☆82Jun 12, 2023Updated 2 years ago
- A curated list of resources for Document Understanding (DU) topic☆1,509Jun 2, 2023Updated 2 years ago
- Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files☆152Sep 17, 2025Updated 7 months ago
- ☆10Jul 27, 2018Updated 7 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Dec 20, 2023Updated 2 years ago
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆155May 14, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- DocILE: Document Information Localization and Extraction Benchmark☆145May 15, 2024Updated last year
- CodeQUEST is a generalizable framework which leverages LLMs to iteratively evaluate and enhance code quality across multiple dimensions f…☆17Feb 11, 2026Updated 2 months ago
- Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs☆17Mar 2, 2020Updated 6 years ago
- Presents an optimized Apache Beam pipeline for generating sentence embeddings (runnable on Cloud Dataflow).☆20Mar 7, 2022Updated 4 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Apr 5, 2022Updated 4 years ago
- DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022☆186Jan 17, 2025Updated last year