herobd / layoutlmv2Links
running LayoutLMv2
☆11Updated 3 years ago
Alternatives and similar repositories for layoutlmv2
Users that are interested in layoutlmv2 are comparing it to the libraries listed below
Sorting:
- source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT☆72Updated 2 years ago
- ☆24Updated 4 years ago
- Textual Visual Semantic Dataset for Text Spotting. CVPRW 2020☆11Updated 3 years ago
- ☆13Updated 5 years ago
- baselines for DocVQA dataset☆21Updated 4 years ago
- The official implementation of InterBERT☆11Updated 2 years ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Updated 2 years ago
- Data of ACL 2019 Paper "Expressing Visual Relationships via Language".☆62Updated 4 years ago
- Zero-Shot Knowledge Distillation in Deep Networks in ICML2019☆49Updated 6 years ago
- FlatNCE: A Novel Contrastive Representation Learning Objective☆90Updated 3 years ago
- ☆131Updated 2 years ago
- Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch☆70Updated 5 years ago
- nocaps: novel object captioning at scale☆10Updated 6 years ago
- ☆43Updated last year
- The implementation of multi-branch attentive Transformer (MAT).☆33Updated 4 years ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Updated last year
- VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)☆56Updated 4 months ago
- Code for EMNLP 2020 paper CoDIR☆41Updated 2 years ago
- Official MXNet implementation of "Embedding Expansion: Augmentation in Embedding Space for Deep Metric Learning" (CVPR 2020)☆79Updated 2 years ago
- ☆20Updated 3 years ago
- [EMNLP 2021] Code and data for our paper "Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers…☆20Updated 3 years ago
- ☆44Updated 4 years ago
- Curriculum Learning related papers and materials☆54Updated 4 years ago
- Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"☆32Updated 4 years ago
- ☆16Updated 4 years ago
- Implementation of our PR 2020 paper:Unsupervised Text-to-Image Synthesis☆13Updated 5 years ago
- Official code for Group-Transformer (Scale down Transformer by Grouping Features for a Lightweight Character-level Language Model, COLING…☆25Updated 4 years ago
- This dataset contains about 110k images annotated with the depth and occlusion relationships between arbitrary objects. It enables resear…☆16Updated 4 years ago
- Implementation of Soft-Label Chain Conditional Random Field for Phrase Grounding in PyTorch☆16Updated 2 years ago
- Code for the ACL2020 paper Character-Level Translation with Self-Attention☆31Updated 4 years ago