ddenron / deco_datasetLinks
This repository holds the annotated spreadsheet files, comprising the DECO dataset.
☆13Updated 6 years ago
Alternatives and similar repositories for deco_dataset
Users that are interested in deco_dataset are comparing it to the libraries listed below
Sorting:
- ☆45Updated 2 months ago
- Code and data for "TURL: Table Understanding through Representation Learning"☆131Updated last month
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆179Updated 2 years ago
- Code and experiment data for ICDM'19 paper, tabular cell classification using pre-trained cell embeddings. Note that the code and data is…☆30Updated 2 years ago
- MTab: Entity Search and Table Annotation with Wikidata, Wikipedia, and DBpedia☆32Updated 3 years ago
- multimodal document analysis☆166Updated last month
- Publicly released code for the LAMBERT model☆103Updated 4 years ago
- TUTA and ForTaP for Structure-Aware and Numerical-Reasoning-Aware Table Pre-Training☆125Updated last week
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆74Updated 3 years ago
- TAT-QA (Tabular And Textual dataset for Question Answering) contains 16,552 questions associated with 2,757 hybrid contexts from real-wor…☆121Updated last year
- Zero-shot entity linking with less data☆15Updated 3 years ago
- ☆22Updated 2 years ago
- Structured Prediction for Entity Linking☆38Updated last year
- ☆37Updated 3 years ago
- JSON Schema format for storing datasets details, documents processed contents, and documents annotations in the document understanding do…☆13Updated last year
- ☆40Updated 4 years ago
- [SIGIR 2021] Retrieving Complex Tables with Multi-Granular Graph Representation Learning.☆48Updated 3 years ago
- ☆82Updated 3 years ago
- [ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.☆103Updated last week
- Implementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations between…☆31Updated 2 years ago
- Code & Data for the Paper "Time Masking for Temporal Language Models", WSDM 2022☆20Updated 2 years ago
- ☆58Updated 4 years ago
- Implementation of paper: HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer Reranking☆73Updated 2 years ago
- A set of Python scripts for preprocessing the Wikidata JSON dump and running simple queries in an efficient manner.☆138Updated last year
- A tool for extracting arbitrary tables from untagged PDF documents☆40Updated 4 years ago
- Mapping Wikipedia pages to Wikidata IDs and vice versa.☆172Updated 2 years ago
- The Semantic Scholar Search Reranker☆107Updated 5 years ago
- Dataset and code for EMNLP2020 paper "HybridQA: A Dataset of Multi-Hop Question Answeringover Tabular and Textual Data"☆241Updated 2 years ago
- SemEval-2021 Task 8: MeasEval data and other bits☆48Updated 3 years ago
- ReFinED is an efficient and accurate entity linking (EL) system.☆230Updated last year