data-liberation / data-liberation-resources
liberate all kinds of data from PDF and other unstructural format and make the information machine-readable and visualizeable for popular tools.
☆31Updated 6 years ago
Alternatives and similar repositories for data-liberation-resources:
Users that are interested in data-liberation-resources are comparing it to the libraries listed below
- ICDAR 2021 Competition on Scientific Literature Parsing☆34Updated 4 years ago
- table understanding dataset for comparative evaluation of different table understanding algorithms☆14Updated 6 years ago
- ☆12Updated 4 years ago
- ☆87Updated 5 years ago
- Code for ICPR2022 paper: "Graph Neural Networks and Representation Embedding for table extraction in PDF Documents"☆35Updated last year
- ☆69Updated 7 years ago
- ☆79Updated 3 years ago
- Framework for information extraction from tables☆41Updated 6 years ago
- XFUND: A Multilingual Form Understanding Benchmark☆200Updated 2 years ago
- Tools for extract figure, table, text, .. from a pdf document.☆32Updated 4 years ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆66Updated 4 years ago
- PDF table extraction☆10Updated 3 years ago
- baike schema crawler for baidu baike , hudongbaike. 面向百度百科与互动百科的概念分类体系抓取脚本☆36Updated 7 years ago
- 中文环境突发事件语料库(Chinese Environment Emergency Corpus)-上海大学-语义智能实验室☆46Updated 9 years ago
- Java command-line tools for comparing results to ground truth for table location and structure detection as used in the ICDAR 2013 Table …☆33Updated 4 years ago
- detect the table image in pdf or other format image by opencv and python .☆53Updated 5 years ago
- Extract templated Open Information Extraction☆16Updated 7 years ago
- A tool for extracting arbitrary tables from untagged PDF documents☆38Updated 4 years ago
- Publicly released code for the LAMBERT model☆103Updated 3 years ago
- This repository contains a 403 images dataset for table detection in documents.☆83Updated 6 years ago
- an unofficial code for augment-XY-CUT in XYLayoutLM☆27Updated 2 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆104Updated 7 months ago
- ☆38Updated 4 years ago
- ☆57Updated 3 years ago
- Table Extraction Tool☆90Updated 7 years ago
- Summary of Responses to Questionnaire on Annotation Platform https://forms.gle/iZk8kehkjAWmB8xe9☆59Updated 4 years ago
- schemakg, a knowledge graph for schema that seeks to cover a range of things as much as possible including entity schema and event schema…☆30Updated 3 years ago
- Layout Analysis Evaluator for the ICDAR 2017 competition on Layout Analysis for Challenging Medieval Manuscripts☆22Updated 5 years ago
- SegPhrase working on Chinese and Arabic☆35Updated 8 years ago
- Functional and structural analysis of tables in research papers (Table disentangling)☆20Updated 7 years ago