This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction.
☆17Mar 20, 2024Updated last year
Alternatives and similar repositories for Token-Path-Prediction-Datasets
Users that are interested in Token-Path-Prediction-Datasets are comparing it to the libraries listed below
Sorting:
- This is the official repository of the EMNLP 2023 paper Reading Order Matters: Information Extraction from Visually-rich Documents by Tok…☆18Mar 15, 2024Updated last year
- Dataset for EMNLP'23 Paper "DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading"☆11Oct 25, 2023Updated 2 years ago
- Dataset and scripts for HRDoc☆41Jun 21, 2023Updated 2 years ago
- Datasets and Evaluation Scripts for CompHRDoc☆56Feb 25, 2025Updated last year
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆117Aug 26, 2024Updated last year
- ☆11Jan 23, 2019Updated 7 years ago
- MQTT Client implemented by C# and Paho project☆11Jun 22, 2015Updated 10 years ago
- 腾讯云移动直播(MLVB) Cordova 插件☆10Sep 5, 2017Updated 8 years ago
- Image dataset augmentation for machine learning☆14Jun 8, 2023Updated 2 years ago
- This repo consists of my implementation of DocFormerV2☆11Mar 31, 2024Updated last year
- This project has included related source codes and datasets of our EMNLP2021 paper☆10May 28, 2022Updated 3 years ago
- 【CVPR 2025】SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting☆16Jul 1, 2025Updated 8 months ago
- Accelerating GOT-OCRv2 with VLLM☆11Nov 15, 2024Updated last year
- Knowledge extraction from semi-structured web.☆13Mar 25, 2024Updated last year
- Code for EMNLP'20 paper "When Hearst Is not Enough: Improving Hypernymy Detection from Corpus with Distributional Models"☆11Nov 10, 2020Updated 5 years ago
- A repository dedicated to learning about ChatGPT training techniques and related knowledge. Contains study notes, code snippets, and reso…☆12Dec 14, 2024Updated last year
- The official code for "OG-HFYOLO :Orientation Gradient Guidance and Heterogeneous Feature Fusion For Deformation Table Cell Instance Segm…☆13Jul 28, 2025Updated 7 months ago
- Citation Extraction and Classifier☆16Jan 15, 2026Updated last month
- The Science knowledge graph ontologies, a.k.a. SKGO, is a suite of OWL ontology models to capture the knowledge of scientific research da…☆15Jul 3, 2025Updated 8 months ago
- A prompt set of ChatGLM-6B☆15Jul 21, 2023Updated 2 years ago
- Mort's dotfiles☆20Feb 14, 2026Updated 2 weeks ago
- Learning Ontologies Via Embeddings☆12Jul 6, 2023Updated 2 years ago
- Young Labeled Faces in the Wild (YLFW): A Dataset for Children Faces Recognition☆14Sep 18, 2024Updated last year
- ☆22Oct 25, 2025Updated 4 months ago
- Parse eslint output into git-blame to unveil the offenders☆14Jan 26, 2017Updated 9 years ago
- Experimental exploration of skills discovery and distribution through MCP primitives. Maintained by the Skills Over MCP Interest Group.☆41Updated this week
- unofficial impelement of the webformer: The Web-page Transformer for Structure Information Extraction☆13Apr 20, 2023Updated 2 years ago
- ☆14Apr 18, 2020Updated 5 years ago
- Implementation of the HTCPCP Protocol☆14Feb 3, 2013Updated 13 years ago
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>☆15Aug 10, 2023Updated 2 years ago
- An integration package connecting PyMuPDF4LLM to LangChain☆16Nov 23, 2025Updated 3 months ago
- My attempt to improve the speed of the newton schulz algorithm, starting from the dion implementation.☆32Dec 5, 2025Updated 2 months ago
- ☆57Jan 23, 2024Updated 2 years ago
- Github repo for ICLR-2025 paper, Fine-tuning Large Language Models with Sparse Matrices☆24Feb 2, 2026Updated last month
- ☆16Dec 10, 2023Updated 2 years ago
- ☆13Jun 20, 2022Updated 3 years ago
- All in one PDF Parser Toolkit☆16Sep 15, 2023Updated 2 years ago
- A Notebook App on Ethereum Network☆11Feb 23, 2026Updated last week