Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”
☆18Dec 6, 2022Updated 3 years ago
Alternatives and similar repositories for AET
Users that are interested in AET are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30May 23, 2023Updated 2 years ago
- My Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks using Tensor…☆12Mar 18, 2022Updated 4 years ago
- Implementation of the ByteDance MagicMix paper☆19Nov 4, 2022Updated 3 years ago
- Main use to store some object trackiing code☆11Sep 1, 2021Updated 4 years ago
- ☆14Jan 15, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Synthesize image datasets of documents in natural scenes with Python+Blender3D☆60Aug 7, 2022Updated 3 years ago
- ☆161Dec 27, 2022Updated 3 years ago
- ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting☆90Feb 11, 2023Updated 3 years ago
- ☆18Jun 7, 2023Updated 2 years ago
- ☆16Jan 30, 2022Updated 4 years ago
- This repository is a concise collection of well known deep learning based document binarization models.☆29Dec 24, 2022Updated 3 years ago
- Image captioning with weight pruning in PyTorch☆22Jan 14, 2022Updated 4 years ago
- Graph Key Information Extraction: GKIE☆11Sep 15, 2022Updated 3 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆14May 26, 2023Updated 2 years ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Dec 4, 2021Updated 4 years ago
- Structured TRIZ prompt engineering for LLMs in an open, portable XML format – MIT licensed.☆17Nov 11, 2025Updated 5 months ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…☆27Feb 16, 2026Updated last month
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Oct 9, 2023Updated 2 years ago
- Pytorch implementation of deep fill v2 (original by Jiayu et al.)☆10Jun 26, 2019Updated 6 years ago
- Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification. ECCV 2022.☆18Jul 12, 2022Updated 3 years ago
- ☆21Dec 18, 2025Updated 3 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆18May 30, 2023Updated 2 years ago
- Library for converting from RGB / GrayScale image to base64 and back.☆19Sep 19, 2022Updated 3 years ago
- Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer☆62Jan 11, 2023Updated 3 years ago
- A hobby project that dewarps book pages in images☆19Jan 5, 2023Updated 3 years ago
- ☆24Sep 2, 2022Updated 3 years ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- ☆82Jun 12, 2023Updated 2 years ago
- ☆10Aug 17, 2021Updated 4 years ago
- IMPACT: A Large-scale Integrated Multimodal Patent Analysis and Creation Dataset for Design Patents (NeurIPS 2024)☆15Jul 14, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Towards Video Text Visual Question Answering: Benchmark and Baseline☆40Feb 26, 2024Updated 2 years ago
- 针对离线中文手写数据集的学习☆18Jan 30, 2020Updated 6 years ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Jan 9, 2025Updated last year
- ☆15May 10, 2021Updated 4 years ago
- ☆10Dec 17, 2020Updated 5 years ago
- Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023☆110Oct 24, 2023Updated 2 years ago
- ☆10Oct 2, 2024Updated last year