guoxy25/Ocean-OCR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/guoxy25/Ocean-OCR)

guoxy25 / Ocean-OCR

☆48

Alternatives and similar repositories for Ocean-OCR

Users that are interested in Ocean-OCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Hxyz-123 / ReasoningOCR
View on GitHub
☆18Jul 24, 2025Updated last year
GbotHQ / ocr-dataset-rendering
View on GitHub
☆39Oct 7, 2023Updated 2 years ago
IITB-LEAP-OCR / SPRINT
View on GitHub
SPRINT: Script-agnostic Structure Recognition in Tables
☆17Mar 26, 2025Updated last year
Tencent / POINTS-Reader
View on GitHub
☆197Dec 7, 2025Updated 7 months ago
Mountchicken / Union14M
View on GitHub
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
☆206Nov 1, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
yeungchenwa / HDR
View on GitHub
[AAAI2025 Oral] Predicting the Original Appearance of Damaged Historical Documents
☆111Jun 28, 2026Updated last month
ucaslcl / Fox
View on GitHub
official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"
☆197May 31, 2024Updated 2 years ago
LukeForeverYoung / UReader
View on GitHub
☆142Feb 13, 2024Updated 2 years ago
VLM-RL / Ocean-R1
View on GitHub
☆26Apr 9, 2025Updated last year
FudanVI / benchmarking-chinese-text-recognition
View on GitHub
This repository contains datasets and baselines for benchmarking Chinese text recognition.
☆509Dec 2, 2022Updated 3 years ago
dreamy-xay / TableCenterNet
View on GitHub
The source code repository for the paper.
☆26Sep 8, 2025Updated 10 months ago
raphael-baena / DTLR
View on GitHub
Handwritten Text Recognition and Character Detection
☆169Sep 28, 2025Updated 10 months ago
HCIILAB / Scene-Text-Recognition-Recommendations
View on GitHub
Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining
☆353Nov 29, 2023Updated 2 years ago
HCIILAB / M6Doc
View on GitHub
☆166May 8, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ymy-k / DPText-DETR
View on GitHub
[AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer
☆204Aug 31, 2023Updated 2 years ago
FutureRising007 / Table_Structure_Recognition
View on GitHub
Table Structure Recognition
☆83Mar 11, 2023Updated 3 years ago
zzyhlyoko / DCTC
View on GitHub
☆42Sep 2, 2023Updated 2 years ago
wzx99 / CLIPOCR
View on GitHub
☆38Oct 20, 2023Updated 2 years ago
Yuliang-Liu / MultimodalOCR
View on GitHub
On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)
☆873Jul 22, 2026Updated last week
EDM-Research / VATr-pp
View on GitHub
☆18Jul 9, 2024Updated 2 years ago
SCUT-DLVCLab / RFUND
View on GitHub
[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…
☆21Dec 4, 2024Updated last year
Dedsec-Xu / DatasetImgLabel-ICDAR2015
View on GitHub
DatasetImgLabeler is a image annotation tool for researchers to prepare datasets in ICDAR2015 format
☆12Dec 7, 2019Updated 6 years ago
mittagessen / curt
View on GitHub
☆15Jul 11, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yeungchenwa / Recommendations-Diffusion-Text-Image
View on GitHub
A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text remova…
☆273Dec 19, 2024Updated last year
lilingxi01 / nougat-replication
View on GitHub
A full codebase for replicating the results of Nougat from downloading arXiv dataset to the final evaluation. It also contains a few fixe…
☆11Dec 11, 2023Updated 2 years ago
MAmmoTH-VL / MAmmoTH-VL
View on GitHub
(ACL 2025) MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale
☆50Jun 4, 2025Updated last year
SII-sc22mc / DocFusion
View on GitHub
A Unified Framework for Document Parsing Tasks (Including Document Layout Analysis, OCR, Formula Recognition, and Table Recognition)
☆15Jul 1, 2025Updated last year
yqingli123 / TDv2
View on GitHub
The source codes of TDv2 in paper: TDv2: A Novel Tree-Structured Decoder for Offline Mathematical Expression Recognition.
☆12Jul 28, 2022Updated 4 years ago
Sanster / OhMyTable
View on GitHub
Table Structure Recognition
☆28Jul 25, 2024Updated 2 years ago
InternScience / SimChart9K
View on GitHub
The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.
☆26Feb 22, 2024Updated 2 years ago
LianaWang / TextRay
View on GitHub
Project code for ACM MM2020 paper: "TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection"
☆47Oct 3, 2023Updated 2 years ago
infly-ai / INF-MLLM
View on GitHub
INF Tech's open-source MLLMs for SOTA visual-language understanding and advanced document intelligence.
☆238Jul 22, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
google-research-datasets / vrdu
View on GitHub
We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…
☆83Feb 8, 2023Updated 3 years ago
effect-handlers / wasm-spec
View on GitHub
WebAssembly specification, reference interpreter, and test suite.
☆13Aug 31, 2023Updated 2 years ago
FaltingsA / SSM
View on GitHub
[IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition
☆10Aug 10, 2025Updated 11 months ago
vayvi / HDV
View on GitHub
Historical Diagram Vectorization
☆20Nov 25, 2025Updated 8 months ago
ChenyuGAO-CS / SMA
View on GitHub
The imdb files with SBD-Trans OCR for TextVQA dataset.
☆11Nov 30, 2021Updated 4 years ago
taolusi / SECURE
View on GitHub
ACL'2024-Main: Synergetic Event Understanding: A Collaborative Approach to Cross-Document Event Coreference Resolution with Large Languag…
☆12Sep 19, 2025Updated 10 months ago
ali-vilab / CAPability
View on GitHub
What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness
☆28May 16, 2025Updated last year