data-liberation / data-liberation-resourcesLinks
liberate all kinds of data from PDF and other unstructural format and make the information machine-readable and visualizeable for popular tools.
☆31Updated 7 years ago
Alternatives and similar repositories for data-liberation-resources
Users that are interested in data-liberation-resources are comparing it to the libraries listed below
Sorting:
- table understanding dataset for comparative evaluation of different table understanding algorithms☆14Updated 7 years ago
- baike schema crawler for baidu baike , hudongbaike. 面向百度百科与互动百科的概念分类体系抓取脚本☆37Updated 7 years ago
- 中文环境突发事件语料库(Chinese Environment Emergency Corpus)-上海大学-语义智能实验室☆46Updated 9 years ago
- Framework for information extraction from tables☆41Updated 6 years ago
- Extract templated Open Information Extraction☆17Updated 8 years ago
- ☆23Updated 5 years ago
- ☆69Updated 7 years ago
- ☆95Updated 5 years ago
- SegPhrase working on Chinese and Arabic☆36Updated 8 years ago
- detect the table image in pdf or other format image by opencv and python .☆54Updated 5 years ago
- Tools for extract figure, table, text, .. from a pdf document.☆32Updated 4 years ago
- 基于CEC语料库挖掘要素识别规则,对新闻报道类生语料进行自动标注☆20Updated 10 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆107Updated 11 months ago
- 医疗语料库。医疗机构名语料库。药品本位码。☆70Updated last year
- Optical table recognition - recognize tables in scan images using OpenCV☆112Updated 6 years ago
- ☆40Updated 4 years ago
- Summary of Responses to Questionnaire on Annotation Platform https://forms.gle/iZk8kehkjAWmB8xe9☆60Updated 5 years ago
- An open-source classical Chinese information processing toolkit developed by Tsinghua Natural Language Processing Group☆51Updated 6 years ago
- Table Extraction Tool☆90Updated 7 years ago
- MNBVC项目-ShareGPT语料清洗☆15Updated last year
- ICDAR 2021 Competition on Scientific Literature Parsing☆35Updated 4 years ago
- ☆81Updated 3 years ago
- BlackLab Frontend, a feature-rich corpus search interface for BlackLab.☆22Updated 2 weeks ago
- ☆87Updated 5 years ago
- A tool for extracting arbitrary tables from untagged PDF documents☆39Updated 4 years ago
- EventKGNELL, event knowlege graph never end learning system, a event-centric knowledge base search system,实时事理逻辑知识库终身学习系统项目和事件为核心的知识库搜索系统…☆72Updated 5 years ago
- Data collection, alignment and TAUS repository☆23Updated 7 years ago
- PDF table extraction☆10Updated 3 years ago
- 物种名称语料库。植物名,动物名。☆50Updated last year
- WordForm,针对中文词语的笔画拆解,偏旁查询,拼音转换接口☆65Updated 6 years ago