☆48Feb 7, 2025Updated last year
Alternatives and similar repositories for Ocean-OCR
Users that are interested in Ocean-OCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆38Oct 7, 2023Updated 2 years ago
- SPRINT: Script-agnostic Structure Recognition in Tables☆16Mar 26, 2025Updated last year
- [ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective☆205Nov 1, 2023Updated 2 years ago
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆195May 31, 2024Updated last year
- ☆142Feb 13, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This repository contains datasets and baselines for benchmarking Chinese text recognition.☆509Dec 2, 2022Updated 3 years ago
- ☆33Jan 17, 2026Updated 4 months ago
- Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining☆354Nov 29, 2023Updated 2 years ago
- Handwritten Text Recognition and Character Detection☆166Sep 28, 2025Updated 7 months ago
- [AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer☆201Aug 31, 2023Updated 2 years ago
- Table Structure Recognition☆28Jul 25, 2024Updated last year
- ☆161May 8, 2025Updated last year
- [ICASSP2024] An official implement of the paper "EFFICIENT SCENE TEXT IMAGE SUPER-RESOLUTION WITH SEMANTIC GUIDANCE"☆24May 12, 2024Updated 2 years ago
- Official implementation for AAAI 2025 paper: SSAN: A Symbol Spatial-Aware Network for Handwritten Mathematical Expression Recognition☆16Jan 21, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆15Jul 11, 2022Updated 3 years ago
- The source code repository for the paper.☆24Sep 8, 2025Updated 8 months ago
- What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness☆27May 16, 2025Updated last year
- ☆42Sep 2, 2023Updated 2 years ago
- A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text remova…☆274Dec 19, 2024Updated last year
- ☆18Jul 9, 2024Updated last year
- DatasetImgLabeler is a image annotation tool for researchers to prepare datasets in ICDAR2015 format☆12Dec 7, 2019Updated 6 years ago
- [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…☆21Dec 4, 2024Updated last year
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆29Dec 18, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- (ACL 2025) MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale☆49Jun 4, 2025Updated 11 months ago
- A full codebase for replicating the results of Nougat from downloading arXiv dataset to the final evaluation. It also contains a few fixe…☆11Dec 11, 2023Updated 2 years ago
- ☆34Jan 13, 2025Updated last year
- [PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Ca…☆81May 18, 2026Updated last week
- Official PyTorch Implementation of "Better Source, Better Flow: Learning Condition-Dependent Source Distribution for Flow Matching"☆31Mar 1, 2026Updated 2 months ago
- Historical Diagram Vectorization☆19Nov 25, 2025Updated 6 months ago
- The source codes of TDv2 in paper: TDv2: A Novel Tree-Structured Decoder for Offline Mathematical Expression Recognition.☆12Jul 28, 2022Updated 3 years ago
- OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models☆30Feb 4, 2026Updated 3 months ago
- The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.☆26Feb 22, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Jun 10, 2025Updated 11 months ago
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆82Feb 8, 2023Updated 3 years ago
- Project code for ACM MM2020 paper: "TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection"☆47Oct 3, 2023Updated 2 years ago
- [IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition☆10Aug 10, 2025Updated 9 months ago
- Hanja Understanding Evaluation Dataset☆15May 2, 2022Updated 4 years ago
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- WebAssembly specification, reference interpreter, and test suite.☆13Aug 31, 2023Updated 2 years ago