Hxyz-123/ReasoningOCR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Hxyz-123/ReasoningOCR)

Hxyz-123 / ReasoningOCR

☆18

Alternatives and similar repositories for ReasoningOCR

Users that are interested in ReasoningOCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MiliLab / LogicOCR
View on GitHub
[arXiv: 2505.12307] LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?
☆35Dec 1, 2025Updated 7 months ago
ViTAE-Transformer / SAMText
View on GitHub
The official repo for the technical report "Scalable Mask Annotation for Video Text Spotting"
☆16May 3, 2023Updated 3 years ago
DREAMXFAR / FCL-Net
View on GitHub
This is the pytorch implementation of FCL-Net, accepted by NN'2022.
☆15May 25, 2022Updated 4 years ago
Hhaizee / RFL-CDNet
View on GitHub
☆12Apr 26, 2024Updated 2 years ago
wangbing1416 / C3DA
View on GitHub
Source code of COLING 2022 paper "A Contrastive Cross-channel Data Augmentation Framework for Aspect-based Sentiment Analysis"
☆22Feb 18, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Hxyz-123 / GoMatching
View on GitHub
[NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching
☆34May 29, 2025Updated last year
WHU-ZQH / UIKA
View on GitHub
Unified Instance and Knowledge Alignment Pretraining for Aspect-based Sentiment Analysis
☆17Mar 27, 2023Updated 3 years ago
SCUT-DLVCLab / OCR-Reasoning
View on GitHub
[ICLR 2026] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning
☆76May 26, 2026Updated 2 months ago
Wei-ucas / TPSNet
View on GitHub
☆28Nov 29, 2023Updated 2 years ago
adeline-cs / GTR
View on GitHub
Scene text recognition
☆108Jul 7, 2022Updated 4 years ago
WHU-ZQH / KGAN
View on GitHub
[TKDE] Knowledge Graph Augmented Network Towards Multiview Representation Learning for Aspect-based Sentiment Analysis
☆52Apr 4, 2024Updated 2 years ago
Perfect-You / SDACD
View on GitHub
☆60Oct 21, 2022Updated 3 years ago
ViTAE-Transformer / ViTAE-Transformer-Scene-Text-Detection
View on GitHub
A comprehensive list [Hi-SAM@TPAMI'24, GoMatching@NeurIPS'24, DeepSolo(++)@ CVPR'23, DPText-DETR@AAAI'23, I3CL@IJCV'22] of our research w…
☆94Nov 12, 2024Updated last year
SCUT-DLVCLab / RFUND
View on GitHub
[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…
☆21Dec 4, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ViTAE-Transformer / DeepSolo
View on GitHub
The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++:…
☆294May 30, 2025Updated last year
lcy0604 / QT-TextSR
View on GitHub
This repository is the implementation of "QT-TextSR: Enhancing scene text image super-resolution via efficient interaction with text reco…
☆20Jul 9, 2025Updated last year
whlscut / DocLayLLM
View on GitHub
[CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding
☆30Dec 18, 2025Updated 7 months ago
MiliLab / S5
View on GitHub
Official repo for [AAAI 2026 Oral] "S5: Scalable Semi-Supervised Semantic Segmentation in Remote Sensing"
☆37Dec 4, 2025Updated 7 months ago
Dedsec-Xu / DatasetImgLabel-ICDAR2015
View on GitHub
DatasetImgLabeler is a image annotation tool for researchers to prepare datasets in ICDAR2015 format
☆12Dec 7, 2019Updated 6 years ago
WHU-ZQH / ChatGPT-vs.-BERT
View on GitHub
🎁[ChatGPT4NLU] A Comparative Study on ChatGPT and Fine-tuned BERT
☆191Apr 17, 2023Updated 3 years ago
weijiawu / TransDETR
View on GitHub
[IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformer
☆114Mar 28, 2024Updated 2 years ago
MiliLab / AnesSuite
View on GitHub
Official repo for [ICLR 2026] "AnesSuite: A Comprehensive Benchmark and Dataset Suite for Anesthesiology Reasoning in LLMs"
☆25Feb 28, 2026Updated 5 months ago
weijiawu / TransVTSpotter
View on GitHub
A new video text spotting framework with Transformer
☆82May 23, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
WHU-ZQH / PANDA
View on GitHub
PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model Adaptation
☆16Mar 28, 2023Updated 3 years ago
DrLuo / SemiETS
View on GitHub
【CVPR 2025】SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting
☆17Jul 1, 2025Updated last year
ymy-k / DPText-DETR
View on GitHub
[AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer
☆204Aug 31, 2023Updated 2 years ago
luyang-NWPU / HGA-STR
View on GitHub
It's the code for <A holistic representation guided attention network for scene text recognition>Neurocomputing 2020
☆17Dec 1, 2020Updated 5 years ago
Quanato607 / XLSTM-HVED
View on GitHub
[ISBI 2025] XLSTM-HVED: Cross-Modal Brain Tumor Segmentation and MRI Reconstruction Method Using Vision XLSTM and Heteromodal Variational…
☆18Jul 9, 2025Updated last year
HDETR / H-PETR-Pose
View on GitHub
[CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".
☆14Sep 1, 2022Updated 3 years ago
JizhiziLi / RIM
View on GitHub
[CVPR 2023] Referring Image Matting
☆208Apr 17, 2023Updated 3 years ago
lqzxt / NGTR
View on GitHub
☆14May 26, 2025Updated last year
rongzhou7 / TTT-Unet
View on GitHub
☆24Dec 12, 2025Updated 7 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
encounter1997 / DE-CondDETR
View on GitHub
Official Implementation of DE-CondDETR and DELA-CondDETR in "Towards Data-Efficient Detection Transformers"
☆45Aug 25, 2022Updated 3 years ago
loong8888 / WAIR
View on GitHub
Wide-angle Image Rectification
☆12Oct 20, 2020Updated 5 years ago
shi-yx / URaG
View on GitHub
Official implementation of URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding (AAAI 2026…
☆43Feb 4, 2026Updated 5 months ago
xdxie / WAS_WordArt-Segmentation
View on GitHub
The official codes and datasets for Artistic Text Segmentation (ECCV 2024).
☆30Sep 24, 2025Updated 10 months ago
srl-freiburg / pedsim
View on GitHub
standalone pedsim library (pedestrian simulator using social force model)
☆12Sep 23, 2015Updated 10 years ago
JerryXu0129 / HHF
View on GitHub
☆12Sep 8, 2022Updated 3 years ago
lmaxwell / Armednn
View on GitHub
cross-platform modular neural network inference library, small and efficient
☆13May 15, 2023Updated 3 years ago