Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”
☆18Dec 6, 2022Updated 3 years ago
Alternatives and similar repositories for AET
Users that are interested in AET are comparing it to the libraries listed below
Sorting:
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30May 23, 2023Updated 2 years ago
- My Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks using Tensor…☆12Mar 18, 2022Updated 3 years ago
- ☆16Jan 30, 2022Updated 4 years ago
- ☆14May 26, 2023Updated 2 years ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Dec 4, 2021Updated 4 years ago
- ☆42Feb 7, 2023Updated 3 years ago
- Library for converting from RGB / GrayScale image to base64 and back.☆19Sep 19, 2022Updated 3 years ago
- ☆24Sep 2, 2022Updated 3 years ago
- ☆18Jun 7, 2023Updated 2 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 2 years ago
- Implementation of the ByteDance MagicMix paper☆19Nov 4, 2022Updated 3 years ago
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…☆26Feb 16, 2026Updated 2 weeks ago
- ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting☆86Feb 11, 2023Updated 3 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆28Apr 17, 2024Updated last year
- ☆19Oct 1, 2021Updated 4 years ago
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆54Aug 8, 2023Updated 2 years ago
- It's a cooler way to store simple linear models.☆27Jul 15, 2024Updated last year
- This repository is a concise collection of well known deep learning based document binarization models.☆27Dec 24, 2022Updated 3 years ago
- ☆161Dec 27, 2022Updated 3 years ago
- Synthesize image datasets of documents in natural scenes with Python+Blender3D☆60Aug 7, 2022Updated 3 years ago
- Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer☆62Jan 11, 2023Updated 3 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- [NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs☆43Feb 11, 2026Updated 2 weeks ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- Unofficial implementation of Towards Accurate Scene Text Recognition with Semantic Reasoning Networks☆28Sep 24, 2021Updated 4 years ago
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Oct 9, 2023Updated 2 years ago
- ☆36Oct 7, 2023Updated 2 years ago
- Structured TRIZ prompt engineering for LLMs in an open, portable XML format – MIT licensed.☆14Nov 11, 2025Updated 3 months ago
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆57Sep 15, 2025Updated 5 months ago
- Remote sensing elevation (3D) change detection with deep learning☆32Jan 18, 2023Updated 3 years ago
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆34Aug 6, 2023Updated 2 years ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆83Sep 12, 2023Updated 2 years ago
- ☆82Oct 7, 2023Updated 2 years ago
- An Educational Framework Based on PyTorch for Deep Learning Education and Exploration☆10Dec 24, 2023Updated 2 years ago
- Main use to store some object trackiing code☆11Sep 1, 2021Updated 4 years ago
- SciCount is tool focused on counting and classifying of objects in image-like data and scientific images, with training and example datas…☆11Oct 24, 2023Updated 2 years ago
- On the Effectiveness of Parameter-Efficient Fine-Tuning☆38Nov 4, 2023Updated 2 years ago
- Vision-Language Models Toolbox: Your all-in-one solution for multimodal research and experimentation☆12Feb 16, 2025Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 3 months ago