phucty/wtabhtml

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/phucty/wtabhtml)

phucty / wtabhtml

Tool to parse wiki tables from the HTML dump of Wikipedia

☆11

Alternatives and similar repositories for wtabhtml

Users that are interested in wtabhtml are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

namtuanly / WikiTableSet
View on GitHub
WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia
☆32Jun 12, 2025Updated last year
jzbjyb / OmniTab
View on GitHub
Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering
☆31Dec 2, 2022Updated 3 years ago
namtuanly / MTL-TabNet
View on GitHub
MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition
☆103May 30, 2024Updated 2 years ago
koikezlemma / phd_thesis_template
View on GitHub
PhD thesis template for Sokendai students
☆13Dec 8, 2015Updated 10 years ago
AILab-UniFI / cte-dataset
View on GitHub
CTE: Contextualized Table Extraction Dataset
☆17Feb 23, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
MaxKinny / TabRecSet
View on GitHub
A large scale camera-taken table detection and recognition dataset.
☆150Apr 9, 2026Updated 3 months ago
abdoelsayed2016 / Table-Detection-Structure-Recognition
View on GitHub
https://dl.acm.org/doi/10.1145/3657281
☆97Apr 25, 2024Updated 2 years ago
uunnhh / TextMountain
View on GitHub
TextMountain
☆23Oct 25, 2020Updated 5 years ago
IBM / SynthTabNet
View on GitHub
Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files
☆154Sep 17, 2025Updated 10 months ago
KnowledgeBaseCompleter / eval-ConvKB
View on GitHub
☆13Oct 18, 2019Updated 6 years ago
IITB-LEAP-OCR / SPRINT
View on GitHub
SPRINT: Script-agnostic Structure Recognition in Tables
☆16Mar 26, 2025Updated last year
wangwen-whu / WTW-Dataset
View on GitHub
This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure …
☆184Sep 15, 2021Updated 4 years ago
phucty / wikidb
View on GitHub
WikiDB: Build a DB (key-value store - LMDB style) from Wikidata dump
☆25Nov 26, 2022Updated 3 years ago
marcotchen / SimpleGPT
View on GitHub
[ICML 2026] Improving GPT via a simple normalization strategy
☆15May 22, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Chunchunwumu / SEMv3
View on GitHub
The official PyTorch implementation of SEMv3.
☆53May 26, 2024Updated 2 years ago
usc-isi-i2 / sand
View on GitHub
Semantic ANotation of tabular Data
☆24Dec 9, 2025Updated 7 months ago
bernhardschaefer / handwritten-diagram-datasets
View on GitHub
☆20Sep 1, 2022Updated 3 years ago
DWCTOD / awesome-computer-vision
View on GitHub
A curated list of awesome computer vision resources（深度学习、计算机视觉优质资料整理），包含各个视觉方向，常用的框架使用手册，经典的教程代码实战和公式推导
☆20Jun 3, 2020Updated 6 years ago
WenmuZhou / TableGeneration
View on GitHub
通过浏览器渲染生成表格图像
☆238Apr 10, 2024Updated 2 years ago
ddddwee1 / Octave-Convolution
View on GitHub
https://arxiv.org/abs/1904.05049
☆13Apr 23, 2019Updated 7 years ago
JiaquanYe / MASTER-mmocr
View on GitHub
Re-implementation of MASTER by mmocr
☆90Sep 9, 2021Updated 4 years ago
L597383845 / row-col-table-recognition
View on GitHub
time-series row column classification
☆14Jan 7, 2022Updated 4 years ago
HagopB / cyclegan
View on GitHub
Keras implementation of CycleGAN
☆13Dec 11, 2017Updated 8 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
XH-B / ABM
View on GitHub
☆105Aug 22, 2024Updated last year
phucty / mtab_tool
View on GitHub
MTab: Entity Search and Table Annotation with Wikidata, Wikipedia, and DBpedia
☆32May 30, 2022Updated 4 years ago
Yuxiang1995 / ICDAR2021_MFD
View on GitHub
1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection（公式检测冠军方案）
☆134Sep 4, 2023Updated 2 years ago
tal-tech / SAN
View on GitHub
Syntax-Aware Network for Handwritten Mathematical Expression Recognition
☆103Feb 21, 2023Updated 3 years ago
LARS-research / TREFE
View on GitHub
Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022
☆13Nov 25, 2022Updated 3 years ago
Hambaobao / Marathon
View on GitHub
Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.
☆10May 16, 2024Updated 2 years ago
hjbplayer / HAM
View on GitHub
The code of 《HAM: Hidden Anchor Mechanism for Scene Text Detection》
☆11Sep 22, 2020Updated 5 years ago
the-qa-company / WikibaseSync
View on GitHub
Library to copy entities from one Wikibase to another and to keep them in sync
☆34Sep 10, 2025Updated 10 months ago
helayzhang / utf8unicode
View on GitHub
Convert utility between utf8 and unicode for C++.
☆15Sep 9, 2013Updated 12 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
sarkhelritesh / vrd_resource
View on GitHub
☆22May 5, 2021Updated 5 years ago
cuppersd / table_recognition
View on GitHub
表格线检测
☆27Sep 3, 2019Updated 6 years ago
felix-schmitt / MathNet
View on GitHub
MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition
☆10Mar 19, 2025Updated last year
ronghanghu / vit_10b_fsdp_example
View on GitHub
See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md
☆25Dec 22, 2022Updated 3 years ago
pyxploiter / deep-splerge
View on GitHub
Implementation of research paper "Deep Splitting and Merging for Table Structure Decomposition"
☆61Nov 9, 2022Updated 3 years ago
Tan-Junwen / awesome-table-structure-recognition
View on GitHub
A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating…
☆232Sep 9, 2024Updated last year
jfma-USTC / HRDoc
View on GitHub
Dataset and scripts for HRDoc
☆42Jun 21, 2023Updated 3 years ago