zhaowc-ustc / TabPedia

This repository is the codebase of TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy

☆15

Related projects ⓘ

Alternatives and complementary repositories for TabPedia

Yuliang-Liu / SPTSv2
☆22Updated last year
bytedance / E2STR
The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
☆45Updated 5 months ago
ayumiymk / DiG
Official PyTorch implementation of `Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition`
☆67Updated last year
wenwenyu / TCM
Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)
☆181Updated 5 months ago
shannanyinxiang / SPTS
Official implementation of SPTS: Single-Point Text Spotting (ACM MM 2022 Oral)
☆137Updated last year
clovaai / units
☆72Updated last year
Mountchicken / Union14M
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
☆170Updated last year
MAEHCM / ICL-D3IE
Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”
☆50Updated last year
weijiawu / TransDETR
[IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformer
☆102Updated 7 months ago
xdxie / WordArt
The official code of CornerTransformer (ECCV 2022, Oral) on top of MMOCR.
☆138Updated last year
ymy-k / DPText-DETR
[AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer
☆174Updated last year
johnning2333 / M2Doc
☆31Updated 5 months ago
mxin262 / Bridging-Text-Spotting
(CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.
☆50Updated 5 months ago
mxin262 / ESTextSpotter
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
☆72Updated 7 months ago
FangShancheng / ABINet-PP
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting
☆80Updated last year
bytedance / oclip
☆50Updated 2 years ago
yh-hust / PDF-Wukong
【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling
☆98Updated last month
mlpc-ucsd / TESTR
(CVPR 2022) Text Spotting Transformers
☆179Updated last year
wangyuxin87 / VisionLAN
A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)
☆98Updated 2 years ago
weijiawu / BOVText-Benchmark
[NeurIPS2021] BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting
☆67Updated last year
ZZR8066 / SEM
☆13Updated last year
large-ocr-model / large-ocr-model.github.io
☆156Updated 8 months ago
lanfeng4659 / STR-TDSL
☆82Updated last year
Yuliang-Liu / Open-Oracle
AI-assisted Deciphering Oracle Bone Script
☆38Updated 2 months ago
yufanchen96 / RoDLA
RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
☆28Updated 7 months ago
shannanyinxiang / PageNet
Official implementation of PageNet (IJCV 2022)
☆78Updated 2 years ago
HCIILAB / M6Doc
☆106Updated 9 months ago
weijiawu / TransVTSpotter
A new video text spotting framework with Transformer
☆77Updated 2 years ago
SCUT-DLVCLab / GPT-4V_OCR
Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)
☆121Updated last year
fh2019ustc / DeepEraser
The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.
☆28Updated 2 months ago