Form2Seq-Data / Dataset
Dataset corresponding to the paper: "Form2Seq : A Framework for Higher-Order Form Structure Extraction"
☆10Updated 4 years ago
Alternatives and similar repositories for Dataset:
Users that are interested in Dataset are comparing it to the libraries listed below
- Unconstrained Text Detection with Box Supervisionand Dynamic Self-Training☆33Updated 2 years ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆17Updated 2 years ago
- ☆39Updated 5 years ago
- Project code for ACM MM2020 paper: "TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection"☆47Updated last year
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆35Updated 3 years ago
- ☆18Updated 3 years ago
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆32Updated last year
- ☆18Updated last year
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated last year
- State-of-the-art data augmentation search algorithms in PyTorch☆47Updated last year
- ☆8Updated 3 years ago
- Focal CTC for End-To-End OMR task with Class Imbalance, SangCTC (Part I)☆22Updated 4 years ago
- Geometry Normalization Networks for Accurate Scene Text Detection (iccv 2019)☆21Updated 4 years ago
- ☆26Updated 2 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Updated 2 years ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Updated 2 years ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Updated 2 years ago
- Scene text rectification using glyph and character alignment properties☆20Updated 7 years ago
- ☆16Updated 3 years ago
- An open source implementation of CLIP.☆32Updated 2 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 2 years ago
- [FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.☆26Updated 2 years ago
- Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)☆12Updated 2 years ago
- ☆25Updated 4 years ago
- The code is based on resnet50. At present, the data set hmean in icdar2015 is about 80. A rough version will be sorted out and optimized …☆8Updated 5 years ago
- ☆24Updated 3 years ago
- Packaged TResNet based on Official PyTorch Implementation☆15Updated 4 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 2 years ago
- ☆41Updated 4 years ago
- An end to end ASR Transformer model training repo☆13Updated 3 years ago