herobd / layoutlmv2
running LayoutLMv2
☆11Updated 2 years ago
Alternatives and similar repositories for layoutlmv2:
Users that are interested in layoutlmv2 are comparing it to the libraries listed below
- Release for CHART annotation tools used for ICDAR CHART 2019 competition☆27Updated last year
- baselines for DocVQA dataset☆21Updated 3 years ago
- ☆19Updated 9 months ago
- Official Implementation of SCOB [ICCV 2023]☆22Updated last year
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆17Updated 2 years ago
- ☆44Updated 3 years ago
- source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT☆73Updated 2 years ago
- VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)☆55Updated 3 years ago
- ☆37Updated 9 months ago
- Textual Visual Semantic Dataset for Text Spotting. CVPRW 2020☆11Updated 2 years ago
- 👾 A library of state-of-the-art pretrained models for Natural Language Processing (NLP)☆9Updated 5 years ago
- CTE: Contextualized Table Extraction Dataset☆17Updated 2 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Updated 2 years ago
- Implementation of "MULE: Multimodal Universal Language Embedding"☆16Updated 5 years ago
- [EMNLP 2021] Code and data for our paper "Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers…☆20Updated 3 years ago
- A dataset of crowdsourced ratings for machine-generated image captions☆35Updated 5 years ago
- ☆24Updated 3 years ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Updated 2 years ago
- ☆10Updated last year
- Unconstrained Text Detection with Box Supervisionand Dynamic Self-Training☆33Updated 2 years ago
- The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."☆36Updated 2 years ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Updated last year
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆52Updated last year
- The official implementation of InterBERT☆11Updated 2 years ago
- The implementation of multi-branch attentive Transformer (MAT).☆33Updated 4 years ago
- ☆15Updated 2 years ago
- Adversarial Sequence-to-sequence Domain Adaptation Network dubbed ASSDA for robust text image recognition☆48Updated 3 years ago
- Curriculum Learning related papers and materials☆54Updated 4 years ago
- Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"☆32Updated 4 years ago
- ☆23Updated 3 years ago