salesforce/QVR-SimpleDLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/salesforce/QVR-SimpleDLM)

salesforce / QVR-SimpleDLM

Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.

☆16

Alternatives and similar repositories for QVR-SimpleDLM

Users that are interested in QVR-SimpleDLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

salesforce / burn-after-reading
View on GitHub
☆14May 15, 2025Updated last year
HenryJunW / TAG
View on GitHub
☆22Dec 8, 2022Updated 3 years ago
salesforce / PB-OVD
View on GitHub
A pytorch Implementation of Open Vocabulary Object Detection with Pseudo Bounding-Box Labels
☆65Jun 25, 2026Updated 3 weeks ago
chenxn2020 / GOSE
View on GitHub
[Paper] Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document"
☆17Dec 1, 2023Updated 2 years ago
due-benchmark / baselines
View on GitHub
The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."
☆36Mar 2, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
clovaai / spade
View on GitHub
☆82Jun 12, 2023Updated 3 years ago
Line-Kite / GraphLayoutLM
View on GitHub
☆14Sep 6, 2024Updated last year
JieyuZ2 / ProVision
View on GitHub
A instruction data generation system for multimodal language models.
☆37Jan 31, 2025Updated last year
ZZR8066 / GraphDoc
View on GitHub
☆45Jul 18, 2022Updated 4 years ago
LoyoYang / DeCoTa
View on GitHub
ICCV 2021: Deep Co-Training with Task Decomposition for Semi-supervised Domain Adaptation
☆16Dec 8, 2022Updated 3 years ago
gayecolakoglu / LayIE-LLM
View on GitHub
☆15Jan 15, 2026Updated 6 months ago
littletomatodonkey / Augment-XY-CUT
View on GitHub
an unofficial code for augment-XY-CUT in XYLayoutLM
☆30Jul 12, 2022Updated 4 years ago
hmvu-nv / vie_geo_llm
View on GitHub
This repo provides Geometric LayoutLM for Vietnamese document and code for export to ONNX
☆14Mar 3, 2024Updated 2 years ago
chongzhangFDU / TPP
View on GitHub
This is the official repository of the EMNLP 2023 paper Reading Order Matters: Information Extraction from Visually-rich Documents by Tok…
☆18Mar 15, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
due-benchmark / du-schema
View on GitHub
JSON Schema format for storing datasets details, documents processed contents, and documents annotations in the document understanding do…
☆14Nov 5, 2024Updated last year
WinterShiver / Token-Path-Prediction
View on GitHub
This is an unofficial implementation to the EMNLP 2023 paper: Reading Order Matters: Information Extraction from Visually-rich Documents …
☆16May 29, 2024Updated 2 years ago
AIM3-RUC / MPMQA
View on GitHub
Official repository of the paper MPMQA: Multimodal Question Answering on Product Manuals (AAAI 2023)
☆21Nov 28, 2022Updated 3 years ago
mnamysl / nat-acl2020
View on GitHub
☆15May 26, 2021Updated 5 years ago
psunlpgroup / MultiHiertt
View on GitHub
Data and code for ACL 2022 paper "MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual Data"
☆54Oct 22, 2024Updated last year
anisha2102 / docvqa
View on GitHub
Document Visual Question Answering
☆130Jul 30, 2020Updated 5 years ago
cl-tohoku / AIO2_DPR_baseline
View on GitHub
https://www.nlp.ecei.tohoku.ac.jp/projects/aio/
☆16Aug 4, 2022Updated 3 years ago
oriyor / turning_tables
View on GitHub
Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…
☆22Nov 2, 2021Updated 4 years ago
jaeyun95 / pre-trained-vlk-model
View on GitHub
pre-trained vision and language model summary
☆12Apr 20, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Curt-Park / triton-inference-server-practice
View on GitHub
Archives for Triton Inference Server Practices
☆15Feb 28, 2022Updated 4 years ago
herobd / dessurt
View on GitHub
Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer
☆62Jan 11, 2023Updated 3 years ago
whlscut / DocLayLLM
View on GitHub
[CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding
☆30Dec 18, 2025Updated 7 months ago
jpWang / LiLT
View on GitHub
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…
☆366Oct 31, 2022Updated 3 years ago
bzluan / TextCoT
View on GitHub
[ACM TOMM] Official implementation of "TextCoT: Zoom-In for Enhanced Multimodal Text-Rich Image Understanding"
☆45Feb 27, 2026Updated 4 months ago
zlwang-cs / LASER-release
View on GitHub
Repo for the paper: Towards Few-shot Entity Recognition in Document Images:A Label-aware Sequence-to-Sequence Framework
☆14May 31, 2023Updated 3 years ago
hannandarryl / ManyModalQA
View on GitHub
Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs
☆19Mar 2, 2020Updated 6 years ago
shirlyliu64 / ConvBench
View on GitHub
ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Ablation Capability for Large Vision-Language Models
☆16Sep 27, 2024Updated last year
kamalkraj / TAPAS-TF2
View on GitHub
End-to-end neural table-text understanding models.
☆10Nov 11, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
thanhhau097 / chargrid2d
View on GitHub
☆16Aug 12, 2021Updated 4 years ago
wzk1015 / CNMT
View on GitHub
[AAAI 2021] Confidence-aware Non-repetitive Multimodal Transformers for TextCaps
☆24Mar 29, 2023Updated 3 years ago
vivien000 / clip-demo
View on GitHub
Minimal user-friendly demo of OpenAI's CLIP for semantic image search
☆20Sep 28, 2024Updated last year
maljovec / nglpy
View on GitHub
A Python wrapped version of the Neighborhood Graph Library (NGL) developed by Carlos Correa and Peter Lindstrom with additional parameter…
☆22Oct 21, 2024Updated last year
ZeningLin / PEneo
View on GitHub
[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.
☆41Apr 7, 2025Updated last year
cxsoto / article-regions
View on GitHub
A dataset of region-annotated scientific articles.
☆21Jan 24, 2020Updated 6 years ago
mineshmathew / DocVQA
View on GitHub
baselines for DocVQA dataset
☆21Apr 11, 2021Updated 5 years ago