This repository holds the annotated spreadsheet files, comprising the DECO dataset.
☆13Mar 21, 2019Updated 7 years ago
Alternatives and similar repositories for deco_dataset
Users that are interested in deco_dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆48Oct 6, 2025Updated 6 months ago
- ☆10Oct 31, 2019Updated 6 years ago
- Code and experiment data for ICDM'19 paper, tabular cell classification using pre-trained cell embeddings. Note that the code and data is…☆29Jul 6, 2023Updated 2 years ago
- SOTA on TabFact: Graph Neural Network for Table-based Fact Checking☆18Dec 10, 2020Updated 5 years ago
- 🌮 Table-based KB Completer☆16Mar 13, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…☆11Oct 18, 2022Updated 3 years ago
- init☆13Feb 3, 2021Updated 5 years ago
- source code and data☆15Jan 16, 2019Updated 7 years ago
- ☆26May 24, 2018Updated 7 years ago
- Spreadsheets from the Enron Corpus☆39Oct 16, 2022Updated 3 years ago
- This repository contains code and data for reproducing the experiments of three papers that focus on two subtasks of table annotation: co…☆12Mar 5, 2025Updated last year
- Code to extract functional dependencies (FDs) and conditional functional dependencies (CFDs) from data☆37Mar 24, 2021Updated 5 years ago
- Neural Solr = Solr 9 + Mighty Inference + Node☆18Jun 9, 2022Updated 3 years ago
- This repository contains code and data for the paper "TableEval: A Real-World Benchmark for Complex, Multilingual, and Multi-Structured T…☆28Jun 12, 2025Updated 10 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code and data for the VLDB 2023 paper: RECA: Related Tables Enhanced Column Semantic Type Annotation Framework☆12May 7, 2025Updated 11 months ago
- Glottolog data as CLDF StructureDataset☆16Mar 2, 2026Updated last month
- Create a QnA bot on a pdf☆16May 27, 2023Updated 2 years ago
- ☆19Sep 3, 2024Updated last year
- ☆17Dec 8, 2022Updated 3 years ago
- This repository contains the code for the perspective paper "Multimodal Neural Databases" accepted at SIGIR 2023.☆20Nov 19, 2024Updated last year
- ☆28May 27, 2024Updated last year
- "Head-to-Tail How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs?" (NAACL 2024)☆19Jul 1, 2024Updated last year
- Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICP…☆10Nov 20, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ArcheType uses LLMs to automatically assign custom labels to your tabular data☆19May 21, 2025Updated 10 months ago
- CVPR 2022: Table Structure Recognition☆40Apr 19, 2022Updated 3 years ago
- GloSAT Historical Measurement Table Dataset☆11Dec 3, 2025Updated 4 months ago
- Code and data for HEF, published in The Web Conference 2021.☆16Mar 31, 2021Updated 5 years ago
- The source code of the Sudowoodo paper in ICDE 2023☆18May 24, 2023Updated 2 years ago
- This robot processes randomly generated PDF invoices with Amazon Textract and saves the extracted invoice data in an Excel file.☆14Jan 27, 2023Updated 3 years ago
- ☆16Sep 6, 2022Updated 3 years ago
- JavaScript library for getting geojson from the Wikipedia API☆22Sep 25, 2015Updated 10 years ago
- Unofficial implementation of the paper "MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition" by Bhunia et al. (2021).☆13Jun 22, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Some realistic tabular datasets for testing (CSV)☆21Mar 7, 2018Updated 8 years ago
- Jupyter server proxy for OpenRefine☆10Oct 18, 2024Updated last year
- Python library for validating and managing binary array linked data files, e.g. HDF, netCDF.☆12Oct 14, 2022Updated 3 years ago
- PDF Extraction Toolkit (wraps and trains LayoutLM)☆10Oct 8, 2021Updated 4 years ago
- Evaluation Tool for the ICDAR 2019 Competition on Table Detection and Recognition☆42May 8, 2022Updated 3 years ago
- Code for "Memory Efficient Meta-Learning with Large Images"☆11Nov 24, 2021Updated 4 years ago
- ☆54Jan 18, 2023Updated 3 years ago