DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems
☆74Sep 29, 2024Updated last year
Alternatives and similar repositories for DocBench
Users that are interested in DocBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Oct 17, 2022Updated 3 years ago
- ☆18Jun 12, 2024Updated last year
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆23Sep 17, 2024Updated last year
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆53Oct 19, 2024Updated last year
- [COLM'24] "Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning"☆21Jun 14, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆19Feb 14, 2023Updated 3 years ago
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆106Mar 31, 2025Updated last year
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Nov 28, 2024Updated last year
- Official repository for EMNLP'22 paper: Grape: Knowledge Graph Enhanced Passage Reader for Open-domain Question Answering☆23Oct 20, 2022Updated 3 years ago
- Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.☆24Nov 23, 2022Updated 3 years ago
- Implementation for the EMNLP 2021 paper "Interactive Machine Comprehension with Dynamic Knowledge Graphs".☆21Aug 31, 2021Updated 4 years ago
- Code repo for EMNLP 2023 paper "Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models"☆23Nov 13, 2023Updated 2 years ago
- Dive-into-LLMs Tutorial for Beginners☆23May 14, 2024Updated 2 years ago
- [EMNLP 2021] Dataset and PyTorch Code for ExplaGraphs: An Explanation Graph Generation Task for Structured Commonsense Reasoning☆14Nov 5, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆36Feb 21, 2025Updated last year
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- DatasetImgLabeler is a image annotation tool for researchers to prepare datasets in ICDAR2015 format☆12Dec 7, 2019Updated 6 years ago
- ☆59May 19, 2025Updated last year
- ☆70Jan 9, 2024Updated 2 years ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 11 months ago
- code for paper Hierarchical Retrieval-Augmented Generation Model with Rethink for Multi-hop Question Answering☆15Aug 13, 2024Updated last year
- [ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation☆13May 13, 2023Updated 3 years ago
- ☆14Jun 10, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"☆22Jun 28, 2024Updated last year
- Code Repo for "Differentiable Open-Ended Commonsense Reasoning" (NAACL 2021)☆32Jun 30, 2023Updated 2 years ago
- ☆88Jun 1, 2023Updated 2 years ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Dec 4, 2021Updated 4 years ago
- ☆23Oct 11, 2024Updated last year
- Official code repository for the main conference paper in EMNLP 2022: SubeventWriter: Iterative Sub-event Sequence Generation with Cohere…☆11Oct 16, 2022Updated 3 years ago
- ☆10Dec 12, 2022Updated 3 years ago
- This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.☆14Feb 24, 2022Updated 4 years ago
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆38Feb 22, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28May 28, 2024Updated last year
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- This is the official implementation of ICML 2025 paper MindLLM: A Subject-Agnostic and Versatile Model for fMRI-to-Text Decoding.☆23Jul 22, 2025Updated 10 months ago
- ☆12Oct 17, 2024Updated last year
- Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering☆31Dec 2, 2022Updated 3 years ago
- [WMT 2022] Implementation of TAL-SJTU's system for WMT22 English-Livonian☆23May 4, 2023Updated 3 years ago
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆11Dec 1, 2022Updated 3 years ago