This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction.
☆17Mar 20, 2024Updated 2 years ago
Alternatives and similar repositories for Token-Path-Prediction-Datasets
Users that are interested in Token-Path-Prediction-Datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official repository of the EMNLP 2023 paper Reading Order Matters: Information Extraction from Visually-rich Documents by Tok…☆18Mar 15, 2024Updated 2 years ago
- This is an unofficial implementation to the EMNLP 2023 paper: Reading Order Matters: Information Extraction from Visually-rich Documents …☆16May 29, 2024Updated last year
- Dataset and scripts for HRDoc☆41Jun 21, 2023Updated 2 years ago
- This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>☆15Aug 10, 2023Updated 2 years ago
- Datasets and Evaluation Scripts for CompHRDoc☆57Feb 25, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆116Aug 26, 2024Updated last year
- This is the official implementation to the EMNLP 2024 paper: Modeling Layout Reading Order as Ordering Relations for Visually-rich Docume…☆31Jan 19, 2026Updated 2 months ago
- ☆13Jun 20, 2022Updated 3 years ago
- Curated list of awesome datasets for various table understanding tasks☆18Sep 5, 2025Updated 7 months ago
- simd enabled column imprints☆11Feb 12, 2018Updated 8 years ago
- a simple neural network☆11Dec 20, 2018Updated 7 years ago
- Repository for the KVP10k dataset☆22Sep 18, 2025Updated 6 months ago
- The official code for "OG-HFYOLO :Orientation Gradient Guidance and Heterogeneous Feature Fusion For Deformation Table Cell Instance Segm…☆13Jul 28, 2025Updated 8 months ago
- ☆19Mar 10, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Accelerating GOT-OCRv2 with VLLM☆10Nov 15, 2024Updated last year
- Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.☆18Apr 23, 2023Updated 2 years ago
- ☆17Nov 1, 2024Updated last year
- Ray Framework (https://github.com/ray-project/ray) on Kubernetes☆13Oct 12, 2018Updated 7 years ago
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆163May 31, 2024Updated last year
- ☆34Dec 19, 2025Updated 3 months ago
- Code for EMNLP'20 paper "When Hearst Is not Enough: Improving Hypernymy Detection from Corpus with Distributional Models"☆11Nov 10, 2020Updated 5 years ago
- Starting with Bi-Directional LSTMS☆18Mar 21, 2018Updated 8 years ago
- ☆14Sep 30, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A repository dedicated to learning about ChatGPT training techniques and related knowledge. Contains study notes, code snippets, and reso…☆13Dec 14, 2024Updated last year
- Official implementation of the ANLS* metric☆22Apr 7, 2026Updated last week
- The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction☆22May 29, 2024Updated last year
- Sentence-BERT (SBERT),is a modification of the pretrained BERT network that use siamese and triplet network structures to derive semantic…☆15Jan 22, 2022Updated 4 years ago
- The code implementation for the article "Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Fram…☆16Apr 3, 2025Updated last year
- android4.0蓝牙开发,兼容2.0蓝牙的使用,喜欢arduino开发的,本代码提供hc-06,08蓝牙模块的连接使用☆16Dec 8, 2017Updated 8 years ago
- Knowledge extraction from semi-structured web.☆13Mar 25, 2024Updated 2 years ago
- unofficial impelement of the webformer: The Web-page Transformer for Structure Information Extraction☆13Apr 20, 2023Updated 2 years ago
- pyProCT is an open source cluster analysis software especially adapted for jobs related with structural proteomics. Its approach allows u…☆10Aug 24, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Use strategy in stock transaction for high revenue.☆11Dec 24, 2015Updated 10 years ago
- ☆14Apr 18, 2020Updated 5 years ago
- Citation Extraction and Classifier☆16Mar 16, 2026Updated 3 weeks ago
- ☆19Jul 16, 2020Updated 5 years ago
- Dataset and Code for ACL 2023 paper: "IM-TQA: A Chinese Table Question Answering Dataset with Implicit and Multi-type Table Structures". …☆27Aug 6, 2024Updated last year
- ☆20Aug 26, 2024Updated last year
- A large Minecraft datapack☆19Sep 5, 2023Updated 2 years ago