sakura2233565548/TabPedia

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sakura2233565548/TabPedia)

sakura2233565548 / TabPedia

This repository is the codebase of TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy

☆51

Alternatives and similar repositories for TabPedia

Users that are interested in TabPedia are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TenMilesLotus / DTSM
View on GitHub
Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator
☆13Apr 28, 2024Updated 2 years ago
ZZR8066 / SEM
View on GitHub
☆19Mar 10, 2023Updated 3 years ago
bytedance / WildDoc
View on GitHub
The official repo for “WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?“
☆74May 19, 2025Updated last year
fh2019ustc / DeepEraser
View on GitHub
The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.
☆52Aug 26, 2024Updated last year
adobe-research / pdftriage
View on GitHub
☆16Oct 6, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
SpursGoZmy / Tabular-LLM
View on GitHub
本项目旨在收集开源的表格智能任务数据集（比如表格问答、表格-文本生成等），将原始数据整理为指令微调格式的数据并微调LLM，进而增强LLM对于表格数据的理解，最终构建出专门面向表格智能任务的大型语言模型。
☆643Apr 22, 2024Updated 2 years ago
bytedance / MTVQA
View on GitHub
MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…
☆64May 15, 2025Updated last year
bytedance / E2STR
View on GitHub
The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
☆55Jun 14, 2024Updated 2 years ago
Dawars / DocMAE
View on GitHub
Unofficial implementation of DocMAE (WIP): Document Image Rectification via Self-supervised Representation Learning
☆20Dec 20, 2023Updated 2 years ago
SpursGoZmy / Table-LLaVA
View on GitHub
Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …
☆227Jun 12, 2025Updated last year
lqzxt / NGTR
View on GitHub
☆14May 26, 2025Updated last year
Chunchunwumu / SEMv3
View on GitHub
The official PyTorch implementation of SEMv3.
☆53May 26, 2024Updated 2 years ago
FelixHertlein / inv3d
View on GitHub
Project page for the ICDAR 2023 Paper "Inv3D: a high-resolution 3D invoice dataset for template-guided single-image document unwarping".
☆13Dec 21, 2023Updated 2 years ago
Xiaomeng-Yang / STR_benchmark_cleansed
View on GitHub
☆14May 26, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
SII-sc22mc / DocFusion
View on GitHub
A Unified Framework for Document Parsing Tasks (Including Document Layout Analysis, OCR, Formula Recognition, and Table Recognition)
☆15Jul 1, 2025Updated last year
neulab / VisualPuzzles
View on GitHub
☆18Nov 30, 2025Updated 7 months ago
wzx99 / CLIPOCR
View on GitHub
☆38Oct 20, 2023Updated 2 years ago
fh2019ustc / DocTr
View on GitHub
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
☆436Jul 10, 2026Updated last week
keepfoolisher / My-DocTr-Plus
View on GitHub
☆43Nov 13, 2023Updated 2 years ago
neumason / Font-Component
View on GitHub
西方学者普遍从汉字部件出发理解汉字，该库给出了中文部件分解的详细说明和数据库。
☆13Jul 20, 2023Updated 3 years ago
husterpzh / PSSR
View on GitHub
Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration （CVPR2023）"
☆10May 15, 2024Updated 2 years ago
USTCAGI / Awesome-LLM-Table-Mining
View on GitHub
☆45Mar 19, 2025Updated last year
MaxKinny / TabRecSet
View on GitHub
A large scale camera-taken table detection and recognition dataset.
☆150Apr 9, 2026Updated 3 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
superxjm / HybridTransparentRecon
View on GitHub
☆13May 26, 2023Updated 3 years ago
WenmuZhou / TableGeneration
View on GitHub
通过浏览器渲染生成表格图像
☆238Apr 10, 2024Updated 2 years ago
Hanzhang-lang / ALTER
View on GitHub
Official implementation of the paper "ALTER: Augmentation for Large-Table-Based Reasoning"
☆15Aug 26, 2024Updated last year
HollyLee2000 / SeBoW-paddle
View on GitHub
This is the paddle code for SeBoW(Self-Born wiring for neural trees), a kind of neural tree born form a large search space
☆11Dec 10, 2021Updated 4 years ago
fh2019ustc / SimFIR
View on GitHub
The official code for “SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning”, ICCV, 20…
☆33Jul 21, 2024Updated last year
maoyunyao / JOINT
View on GitHub
Official implementation of the ICCV 2021 paper "Joint Inductive and Transductive Learning for Video Object Segmentation"
☆32Aug 27, 2021Updated 4 years ago
PigeonDan1 / ps-slm
View on GitHub
TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks
☆27Jan 19, 2026Updated 6 months ago
harrytea / UDoc-GAN
View on GitHub
Official PyTorch implementation for ACM MM22 "UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior"
☆25Aug 5, 2024Updated last year
2bgm / KIE-HVQA
View on GitHub
☆13Jun 10, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
claudiom4sir / MdVRNet
View on GitHub
[VISAPP 2022] MdVRNet: Deep Video Restoration under Multiple Distortions
☆12Aug 7, 2024Updated last year
bytedance / SPTSv2
View on GitHub
The official implementation of SPTS v2: Single-Point Text Spotting
☆138Jun 29, 2023Updated 3 years ago
none1113350 / CMMTR_resilience-assessment-for-BRTN
View on GitHub
a project of resilience assessment for a bus-rail transit network
☆14Jul 4, 2023Updated 3 years ago
MCC-WH / Token
View on GitHub
Official implementation of the AAAI 2022 paper "Learning Token-based Representation for Image Retrieval"
☆70Feb 11, 2023Updated 3 years ago
LayTextLLM / LayTextLLM
View on GitHub
☆103Dec 23, 2024Updated last year
360AILAB-NLP / 360LayoutAnalysis
View on GitHub
360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute
☆305Sep 10, 2024Updated last year
xhli-git / DocSAM
View on GitHub
☆33Apr 8, 2025Updated last year