Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.
☆16May 1, 2025Updated last year
Alternatives and similar repositories for QVR-SimpleDLM
Users that are interested in QVR-SimpleDLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Dec 8, 2022Updated 3 years ago
- A pytorch Implementation of Open Vocabulary Object Detection with Pseudo Bounding-Box Labels☆65Jun 2, 2026Updated last week
- The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."☆36Mar 2, 2023Updated 3 years ago
- [Paper] Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document"☆17Dec 1, 2023Updated 2 years ago
- ☆81Jun 12, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A instruction data generation system for multimodal language models.☆37Jan 31, 2025Updated last year
- Resources about Design + AI (papers, datasets, events, companies, etc.)☆64Mar 17, 2021Updated 5 years ago
- ☆45Jul 18, 2022Updated 3 years ago
- ICCV 2021: Deep Co-Training with Task Decomposition for Semi-supervised Domain Adaptation☆16Dec 8, 2022Updated 3 years ago
- an unofficial code for augment-XY-CUT in XYLayoutLM☆30Jul 12, 2022Updated 3 years ago
- This is an unofficial implementation to the EMNLP 2023 paper: Reading Order Matters: Information Extraction from Visually-rich Documents …☆16May 29, 2024Updated 2 years ago
- This is the official repository of the EMNLP 2023 paper Reading Order Matters: Information Extraction from Visually-rich Documents by Tok…☆18Mar 15, 2024Updated 2 years ago
- This repo provides Geometric LayoutLM for Vietnamese document and code for export to ONNX☆14Mar 3, 2024Updated 2 years ago
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- JSON Schema format for storing datasets details, documents processed contents, and documents annotations in the document understanding do…☆14Nov 5, 2024Updated last year
- ☆10Oct 1, 2020Updated 5 years ago
- Multi-span Style Extraction for Generative Reading Comprehension☆10Apr 2, 2021Updated 5 years ago
- Official repository of the paper MPMQA: Multimodal Question Answering on Product Manuals (AAAI 2023)☆19Nov 28, 2022Updated 3 years ago
- Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs☆18Mar 2, 2020Updated 6 years ago
- Document Visual Question Answering☆131Jul 30, 2020Updated 5 years ago
- https://www.nlp.ecei.tohoku.ac.jp/projects/aio/☆16Aug 4, 2022Updated 3 years ago
- ☆52May 28, 2024Updated 2 years ago
- A large-scale infographics dataset from Visual.ly with metadata and additional crowdsourced annotations☆16Oct 8, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆136Oct 18, 2025Updated 7 months ago
- The repo of the Doc2SoarGraph framework☆10Sep 17, 2024Updated last year
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Nov 2, 2021Updated 4 years ago
- pre-trained vision and language model summary☆12Apr 20, 2021Updated 5 years ago
- Archives for Triton Inference Server Practices☆15Feb 28, 2022Updated 4 years ago
- ☆24Mar 7, 2023Updated 3 years ago
- ☆13Jun 20, 2023Updated 2 years ago
- Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer☆62Jan 11, 2023Updated 3 years ago
- [ACM TOMM] Official implementation of "TextCoT: Zoom-In for Enhanced Multimodal Text-Rich Image Understanding"☆45Feb 27, 2026Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICCV 2019] Balanced Datasets Are Not Enough: Estimating and Mitigating Gender Bias in Deep Image Representations☆31Aug 6, 2021Updated 4 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆366Oct 31, 2022Updated 3 years ago
- ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Ablation Capability for Large Vision-Language Models☆16Sep 27, 2024Updated last year
- Minimal user-friendly demo of OpenAI's CLIP for semantic image search☆19Sep 28, 2024Updated last year
- [AAAI 2021] Confidence-aware Non-repetitive Multimodal Transformers for TextCaps☆24Mar 29, 2023Updated 3 years ago
- [MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.☆41Apr 7, 2025Updated last year
- ☆12Sep 2, 2021Updated 4 years ago