SJTU-DeepVisionLab / PosFormerLinks

[ECCV2024] PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer

☆82

Alternatives and similar repositories for PosFormer

Users that are interested in PosFormer are comparing it to the libraries listed below

Sorting:

qingzhenduyu / TAMER
Official implementation for AAAI 2025 paper: TAMER: Tree-Aware Transformer for Handwritten Mathematical Expression Recognition
☆31Updated last week
PriNing / ODM
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
☆39Updated 3 months ago
shannanyinxiang / UPOCR
Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)
☆59Updated last year
SCUT-DLVCLab / MegaHan97K
[PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Ca…
☆62Updated 3 weeks ago
bytedance / E2STR
The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
☆53Updated last year
TenMilesLotus / DTSM
Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator
☆12Updated last year
Green-Wood / CoMER
Official implementation for ECCV 2022 paper "CoMER: Modeling Coverage for Transformer-based Handwritten Mathematical Expression Recogniti…
☆125Updated 2 years ago
tal-tech / SAN
Syntax-Aware Network for Handwritten Mathematical Expression Recognition
☆94Updated 2 years ago
TongkunGuan / Text-Related-Papers
Update the latest text-related papers from top conferences
☆25Updated 4 months ago
liuzhuang1024 / SAM
Semantic Graph Representation Learning for Handwritten Mathematical Expression Recognition (ICDAR 2023)
☆15Updated last year
HCIILAB / LAST
Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition
☆28Updated last year
qingzhenduyu / ICAL
Official implementation for ICDAR 2024 Oral paper "ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expressi…
☆27Updated 11 months ago
shannanyinxiang / ViTEraser
Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20…
☆55Updated last year
CyrilSterling / LPV
The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)
☆26Updated last year
Mountchicken / Union14M
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
☆189Updated last year
mxin262 / Bridging-Text-Spotting
(CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.
☆66Updated last year
wenwenyu / TCM
Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)
☆193Updated last year
zzyhlyoko / DCTC
☆42Updated last year
yufanchen96 / RoDLA
RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
☆36Updated 4 months ago
ispamm / NAF-DPM
NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement
☆45Updated last year
wzx99 / CLIPOCR
☆38Updated last year
XiiZhao / cbn.pytorch
Official PyTorch implementation of "CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection"
☆21Updated last year
fh2019ustc / DeepEraser
The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.
☆42Updated 11 months ago
xdxie / WordArt
The official code of CornerTransformer (ECCV 2022, Oral) on top of MMOCR.
☆145Updated 2 years ago
SCUT-DLVCLab / OCR-Reasoning
[arXiv: 2505.17163] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning
☆60Updated 2 months ago
mxin262 / ESTextSpotter
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
☆77Updated last year
retsuh-bqw / SRFormer-Text-Det
[AAAI 2024] SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression
☆66Updated 5 months ago
lcy0604 / CTRNet
This repository is the implementation of "Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Contex…
☆87Updated 2 years ago
HCIILAB / MSDS
[NeurIPS 2022 Spotlight] The official GitHub page of "MSDS: A Large-Scale Chinese Signature and Token Digit String Dataset for Handwritin…
☆87Updated this week
csguoh / LEMMA
[IJCAI2023] Your text images can be more clearer!
☆58Updated last year