data-liberation / data-liberation-resources
liberate all kinds of data from PDF and other unstructural format and make the information machine-readable and visualizeable for popular tools.
☆31Updated 6 years ago
Alternatives and similar repositories for data-liberation-resources:
Users that are interested in data-liberation-resources are comparing it to the libraries listed below
- table understanding dataset for comparative evaluation of different table understanding algorithms☆14Updated 6 years ago
- AI_DocumentLayoutAnalysis☆38Updated 4 years ago
- ☆87Updated 5 years ago
- Framework for information extraction from tables☆41Updated 5 years ago
- ☆69Updated 6 years ago
- ☆38Updated 4 years ago
- ICDAR 2021 Competition on Scientific Literature Parsing☆34Updated 4 years ago
- A tool for extracting arbitrary tables from untagged PDF documents☆38Updated 4 years ago
- OCR & Ground Truth Resources☆74Updated 2 years ago
- ☆93Updated 4 years ago
- Java command-line tools for comparing results to ground truth for table location and structure detection as used in the ICDAR 2013 Table …☆33Updated 4 years ago
- Optical table recognition - recognize tables in scan images using OpenCV☆111Updated 5 years ago
- Table Extraction Tool☆90Updated 7 years ago
- The ICDAR 2019 cTDaR is to evaluate the performance of methods for table detection (TRACK A) and table recognition (TRACK B). For the fir…☆173Updated 2 years ago
- detect the table image in pdf or other format image by opencv and python .☆53Updated 5 years ago
- This repository contains a 403 images dataset for table detection in documents.☆83Updated 6 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆104Updated 7 months ago
- ☆22Updated 5 years ago
- baike schema crawler for baidu baike , hudongbaike. 面向百度百科与互动百科的概念分类体系抓取脚本☆36Updated 6 years ago
- Tools for extract figure, table, text, .. from a pdf document.☆32Updated 4 years ago
- 1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection(公式检测冠军方案)☆130Updated last year
- PDF table extraction☆10Updated 3 years ago
- Extract templated Open Information Extraction☆16Updated 7 years ago
- ☆78Updated 2 years ago
- A tutorial on the PyTorch-based ocropus components.☆73Updated 4 years ago
- DFKI Layout Detection for OCR-D☆47Updated this week
- Publicly released code for the LAMBERT model☆103Updated 3 years ago
- Layout Analysis Evaluator for the ICDAR 2017 competition on Layout Analysis for Challenging Medieval Manuscripts☆22Updated 5 years ago
- A dataset of region-annotated scientific articles.☆21Updated 5 years ago
- SegPhrase working on Chinese and Arabic☆35Updated 8 years ago