This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction.
☆17Mar 20, 2024Updated 2 years ago
Alternatives and similar repositories for Token-Path-Prediction-Datasets
Users that are interested in Token-Path-Prediction-Datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official repository of the EMNLP 2023 paper Reading Order Matters: Information Extraction from Visually-rich Documents by Tok…☆18Mar 15, 2024Updated 2 years ago
- This is an unofficial implementation to the EMNLP 2023 paper: Reading Order Matters: Information Extraction from Visually-rich Documents …☆16May 29, 2024Updated last year
- Dataset for EMNLP'23 Paper "DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading"☆11Oct 25, 2023Updated 2 years ago
- Dataset and scripts for HRDoc☆41Jun 21, 2023Updated 2 years ago
- This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>☆15Aug 10, 2023Updated 2 years ago
- Datasets and Evaluation Scripts for CompHRDoc☆57Feb 25, 2025Updated last year
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆117Aug 26, 2024Updated last year
- This is the official implementation to the EMNLP 2024 paper: Modeling Layout Reading Order as Ordering Relations for Visually-rich Docume…☆31Jan 19, 2026Updated 2 months ago
- ☆13Jun 20, 2022Updated 3 years ago
- Curated list of awesome datasets for various table understanding tasks☆18Sep 5, 2025Updated 6 months ago
- This project has included related source codes and datasets of our EMNLP2021 paper☆10May 28, 2022Updated 3 years ago
- a simple neural network☆11Dec 20, 2018Updated 7 years ago
- Repository for the KVP10k dataset☆22Sep 18, 2025Updated 6 months ago
- This repo consists of my implementation of DocFormerV2☆11Mar 31, 2024Updated last year
- ☆19Mar 10, 2023Updated 3 years ago
- Accelerating GOT-OCRv2 with VLLM☆11Nov 15, 2024Updated last year
- Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.☆18Apr 23, 2023Updated 2 years ago
- ☆17Nov 1, 2024Updated last year
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆162May 31, 2024Updated last year
- Code for EMNLP'20 paper "When Hearst Is not Enough: Improving Hypernymy Detection from Corpus with Distributional Models"☆11Nov 10, 2020Updated 5 years ago
- Starting with Bi-Directional LSTMS☆18Mar 21, 2018Updated 8 years ago
- ☆14Sep 30, 2021Updated 4 years ago
- A repository dedicated to learning about ChatGPT training techniques and related knowledge. Contains study notes, code snippets, and reso…☆13Dec 14, 2024Updated last year
- The Science knowledge graph ontologies, a.k.a. SKGO, is a suite of OWL ontology models to capture the knowledge of scientific research da…☆16Jul 3, 2025Updated 8 months ago
- The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction☆22May 29, 2024Updated last year
- Parse eslint output into git-blame to unveil the offenders☆14Jan 26, 2017Updated 9 years ago
- ☆16Jun 18, 2022Updated 3 years ago
- 安卓上的动态代理,可代理所有类(包括final类)☆22Feb 8, 2021Updated 5 years ago
- android4.0蓝牙开发,兼容2.0蓝牙的使用,喜欢arduino开发的,本代码提供hc-06,08蓝牙模块的连接使用☆16Dec 8, 2017Updated 8 years ago
- Knowledge extraction from semi-structured web.☆13Mar 25, 2024Updated 2 years ago
- unofficial impelement of the webformer: The Web-page Transformer for Structure Information Extraction☆13Apr 20, 2023Updated 2 years ago
- pyProCT is an open source cluster analysis software especially adapted for jobs related with structural proteomics. Its approach allows u…☆10Aug 24, 2017Updated 8 years ago
- Awesome LLM for NLG Evaluation Papers☆25Jan 23, 2024Updated 2 years ago
- ☆14Apr 18, 2020Updated 5 years ago
- Citation Extraction and Classifier☆16Mar 16, 2026Updated last week
- Learning Ontologies Via Embeddings☆12Jul 6, 2023Updated 2 years ago
- ☆19Jul 16, 2020Updated 5 years ago
- Dataset and Code for ACL 2023 paper: "IM-TQA: A Chinese Table Question Answering Dataset with Implicit and Multi-type Table Structures". …☆27Aug 6, 2024Updated last year
- Series of deep reinforcement learning algorithms 🤖☆29Jun 19, 2021Updated 4 years ago