AILab-UniFI/cte-dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AILab-UniFI/cte-dataset)

AILab-UniFI / cte-dataset

CTE: Contextualized Table Extraction Dataset

☆17

Alternatives and similar repositories for cte-dataset

Users that are interested in cte-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

andreagemelli / doc2graph
View on GitHub
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
☆139Oct 18, 2025Updated 9 months ago
abdoelsayed2016 / Table-Detection-Structure-Recognition
View on GitHub
https://dl.acm.org/doi/10.1145/3657281
☆97Apr 25, 2024Updated 2 years ago
pyxploiter / deep-splerge
View on GitHub
Implementation of research paper "Deep Splitting and Merging for Table Structure Decomposition"
☆61Nov 9, 2022Updated 3 years ago
phucty / wtabhtml
View on GitHub
Tool to parse wiki tables from the HTML dump of Wikipedia
☆11Jun 12, 2022Updated 4 years ago
AILab-UniFI / GNN-TableExtraction
View on GitHub
Code for ICPR2022 paper: "Graph Neural Networks and Representation Embedding for table extraction in PDF Documents"
☆37Jul 13, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yilunzhao / Awsome-Table-Reasoning
View on GitHub
A comprehensive paper list of Reasoning over Tables.
☆30Nov 6, 2022Updated 3 years ago
herobd / FUDGE
View on GitHub
Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"
☆33Mar 4, 2022Updated 4 years ago
xuewenyuan / ReS2TIM
View on GitHub
ReS2TIM: Reconstruct Syntactic Structures from Table Images
☆23Sep 10, 2020Updated 5 years ago
IBM / SynthTabNet
View on GitHub
Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files
☆154Sep 17, 2025Updated 10 months ago
Form2Seq-Data / Dataset
View on GitHub
Dataset corresponding to the paper: "Form2Seq : A Framework for Higher-Order Form Structure Extraction"
☆10Feb 17, 2021Updated 5 years ago
wanghaisheng / ocr-arxiv-daily
View on GitHub
☆19Jun 7, 2023Updated 3 years ago
gmarus777 / Printed-Latex-Data-Generation
View on GitHub
Python and JS tools to generate Printed LaTex formulas and images
☆16Oct 26, 2023Updated 2 years ago
jtonglet / Numerical-Hybrid-QA-Literature
View on GitHub
A list of Numerical Multimodal reasoning papers and their implementation
☆11May 13, 2024Updated 2 years ago
Line-Kite / GraphLayoutLM
View on GitHub
☆14Sep 6, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
uakarsh / TiLT-Implementation
View on GitHub
Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.
☆18Apr 23, 2023Updated 3 years ago
nehamjain10 / Finding_Tables
View on GitHub
An end to end Deep Learning Solution for table detection and structure recognition
☆12Feb 26, 2021Updated 5 years ago
yakuza8 / first-order-predicate-logic-theorem-prover
View on GitHub
Autonomous Theorem Prover for First Order Predicate Logic
☆12Jun 29, 2020Updated 6 years ago
Bai-YT / AdaptiveSmoothing
View on GitHub
Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".
☆10Feb 6, 2024Updated 2 years ago
earth2observe / downscaling-tools
View on GitHub
python programs and procedures that facilitate local application of the earth2observe global water resources reanalysis
☆10Nov 21, 2017Updated 8 years ago
daddydrac / NVIDIA-Rapids-NeMo-PyTorch-Tensorboard
View on GitHub
Ultimate NLP Toolkit for GPUs: RAPIDS-AI, PyTorch, NeMo, Tensorboard, TensorRT, CUDA 10.1
☆10Mar 19, 2020Updated 6 years ago
strikingly / blog
View on GitHub
☆10Jan 28, 2016Updated 10 years ago
NeuraSearch / NeurIPS-2022-Submission-3358
View on GitHub
This is the code for the Submission 3358 at NeurIPS 2022.
☆22Dec 21, 2022Updated 3 years ago
wangwen-whu / WTW-Dataset
View on GitHub
This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure …
☆184Sep 15, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
tmbdev-archive / torchmore
View on GitHub
☆25Oct 9, 2022Updated 3 years ago
sidgairo18 / unsupervised-style-learning
View on GitHub
This repository contains the source code, models and data files for the work titled: "Unsupervised Image Style Embeddings for Retrieval a…
☆13May 29, 2021Updated 5 years ago
h-munakata / Lighthouse-Wrapper-for-Audio-Moment-Retrieval
View on GitHub
☆13Mar 23, 2026Updated 4 months ago
zhongwanjun / CARP
View on GitHub
code for the table-based open domain question answering project, with paper title: "Reasoning over Hybrid Chain for Table-and-Text Open D…
☆12Sep 16, 2022Updated 3 years ago
wutong8023 / SpeechRE
View on GitHub
☆11Nov 11, 2022Updated 3 years ago
kjw11 / CSEnet-ASR
View on GitHub
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆12Mar 14, 2025Updated last year
doc-analysis / DocBank
View on GitHub
DocBank: A Benchmark Dataset for Document Layout Analysis
☆654Aug 12, 2024Updated last year
sairin1202 / SciXGen
View on GitHub
Dataset and model in the paper "SciXGen: A Scientific Paper Dataset for Context-Aware Text Generation"
☆13Feb 14, 2022Updated 4 years ago
azozello / incrementalDBSCAN
View on GitHub
Py implementation of incremental Density-based spatial clustering of applications with noise
☆12Jul 1, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
priba / graph_metric.pytorch
View on GitHub
Graph Metric Learning in PyTorch
☆10Apr 7, 2021Updated 5 years ago
WenmuZhou / TableGeneration
View on GitHub
通过浏览器渲染生成表格图像
☆238Apr 10, 2024Updated 2 years ago
namtuanly / MTL-TabNet
View on GitHub
MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition
☆103May 30, 2024Updated 2 years ago
zzh-SJTU / CRT-QA
View on GitHub
The official data and code for EMNLP 2023 main conference paper: CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular D…
☆13May 19, 2025Updated last year
zhongwanjun / ProQA
View on GitHub
The code for paper "ProQA: Structural Prompt-based Pre-training for Unified Question Answering"
☆11Feb 7, 2023Updated 3 years ago
adsarwate / mergetex
View on GitHub
Script for merging LaTeX files and stripping comments, in preparation for submission to ArXiV
☆10May 23, 2014Updated 12 years ago
zh1qun / aecid_incremental_clustering
View on GitHub
日志增量聚类算法，用于日志异常检测
☆12Aug 20, 2022Updated 3 years ago