This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction.
☆17Mar 20, 2024Updated 2 years ago
Alternatives and similar repositories for Token-Path-Prediction-Datasets
Users that are interested in Token-Path-Prediction-Datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official repository of the EMNLP 2023 paper Reading Order Matters: Information Extraction from Visually-rich Documents by Tok…☆18Mar 15, 2024Updated 2 years ago
- Dataset for EMNLP'23 Paper "DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading"☆11Oct 25, 2023Updated 2 years ago
- Dataset and scripts for HRDoc☆41Jun 21, 2023Updated 2 years ago
- This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>☆15Aug 10, 2023Updated 2 years ago
- Datasets and Evaluation Scripts for CompHRDoc☆59Feb 25, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆117Aug 26, 2024Updated last year
- ☆13Jun 20, 2022Updated 3 years ago
- Curated list of awesome datasets for various table understanding tasks☆18Sep 5, 2025Updated 8 months ago
- This project has included related source codes and datasets of our EMNLP2021 paper☆10May 28, 2022Updated 3 years ago
- Repository for the KVP10k dataset☆23Sep 18, 2025Updated 8 months ago
- ☆11Jan 23, 2019Updated 7 years ago
- The official code for "OG-HFYOLO :Orientation Gradient Guidance and Heterogeneous Feature Fusion For Deformation Table Cell Instance Segm…☆13Jul 28, 2025Updated 9 months ago
- ☆19Mar 10, 2023Updated 3 years ago
- Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.☆18Apr 23, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"☆81Oct 14, 2023Updated 2 years ago
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆163May 31, 2024Updated last year
- [EMNLP2020] When Hearst Is not Enough: Improving Hypernymy Detection from Corpus with Distributional Models☆11Nov 10, 2020Updated 5 years ago
- ☆14Sep 30, 2021Updated 4 years ago
- A repository dedicated to learning about ChatGPT training techniques and related knowledge. Contains study notes, code snippets, and reso…☆13Dec 14, 2024Updated last year
- The Science knowledge graph ontologies, a.k.a. SKGO, is a suite of OWL ontology models to capture the knowledge of scientific research da…☆16Jul 3, 2025Updated 10 months ago
- The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction☆21May 29, 2024Updated last year
- Official implementation of the ANLS* metric☆22May 15, 2026Updated last week
- Parse eslint output into git-blame to unveil the offenders☆13Jan 26, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 安卓上的动态代理,可代理所有类(包括final类)☆22Feb 8, 2021Updated 5 years ago
- Knowledge extraction from semi-structured web.☆13Mar 25, 2024Updated 2 years ago
- unofficial impelement of the webformer: The Web-page Transformer for Structure Information Extraction☆13Apr 20, 2023Updated 3 years ago
- android4.0蓝牙开发,兼容2.0蓝牙的使用,喜欢arduino开发的,本代码提供hc-06,08蓝牙模块的连接使用☆16Dec 8, 2017Updated 8 years ago
- Use strategy in stock transaction for high revenue.☆10Dec 24, 2015Updated 10 years ago
- ☆14Apr 18, 2020Updated 6 years ago
- Citation Extraction and Classifier☆16Apr 18, 2026Updated last month
- Learning Ontologies Via Embeddings☆12Jul 6, 2023Updated 2 years ago
- ☆19Jul 16, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆20Aug 26, 2024Updated last year
- Dataset and Code for ACL 2023 paper: "IM-TQA: A Chinese Table Question Answering Dataset with Implicit and Multi-type Table Structures". …☆27Aug 6, 2024Updated last year
- CypherBench: Towards Precise Retrieval over Full-scale Modern Knowledge Graphs in the LLM Era☆36Jun 18, 2025Updated 11 months ago
- ☆14May 23, 2024Updated 2 years ago
- A large Minecraft datapack☆19Sep 5, 2023Updated 2 years ago
- All in one PDF Parser Toolkit☆17Sep 15, 2023Updated 2 years ago
- Data and Code for EMNLP 2023 paper "QTSumm: Query-Focused Summarization over Tabular Data"☆23Mar 29, 2024Updated 2 years ago