TommyZihao/MMOCR_tutorials

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TommyZihao/MMOCR_tutorials)

TommyZihao / MMOCR_tutorials

Jupyter notebook tutorials for MMOCR

☆44

Alternatives and similar repositories for MMOCR_tutorials

Users that are interested in MMOCR_tutorials are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zzyhlyoko / DCTC
View on GitHub
☆42Sep 2, 2023Updated 2 years ago
open-mmlab / mmocr
View on GitHub
OpenMMLab Text Detection, Recognition and Understanding Toolbox
☆4,747Nov 27, 2024Updated last year
TommyZihao / MMClassification_Tutorials
View on GitHub
Jupyter notebook tutorials for MMClassification
☆28Aug 23, 2022Updated 3 years ago
MelosY / CAM
View on GitHub
☆27Feb 20, 2024Updated 2 years ago
bytedance / SPTSv2
View on GitHub
The official implementation of SPTS v2: Single-Point Text Spotting
☆138Jun 29, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
JD-GenX / cbn.pytorch
View on GitHub
Official PyTorch implementation of "CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection"
☆23Mar 30, 2024Updated 2 years ago
TommyZihao / MMPretrain_Tutorials
View on GitHub
Jupyter notebook tutorials for MMPretrain
☆15Oct 10, 2023Updated 2 years ago
win5923 / TrOCR-Handwritten-Mathematical-Expression-Recognition
View on GitHub
Handwritten mathematical symbols recognition with TrOCR
☆21Jul 11, 2023Updated 3 years ago
wenwenyu / TCM
View on GitHub
Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)
☆202Jun 17, 2024Updated 2 years ago
clovaai / units
View on GitHub
☆78Aug 7, 2023Updated 2 years ago
espnet / warp-ctc
View on GitHub
Pytorch Bindings for warp-ctc maintained by ESPnet
☆17Feb 20, 2021Updated 5 years ago
shuyansy / MLLM-Semantic-Hallucination
View on GitHub
🔥🔥[NeurIPS2025]Exploring and mitigating semantic hallucinations in scene text perception and reasoning
☆30Dec 11, 2025Updated 7 months ago
zhiminzhang0830 / FCENet_Paddle
View on GitHub
☆12Dec 29, 2021Updated 4 years ago
FelixHertlein / doc-matcher
View on GitHub
Inference, training and evaluation code for our paper "DocMatcher: Document Image Dewarping via Structural and Textual Line Matching" (WA…
☆55Jul 1, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
wzx99 / CLIPOCR
View on GitHub
☆38Oct 20, 2023Updated 2 years ago
lcy0604 / CTRNet-plus
View on GitHub
The official implement of CTRNet++.
☆15Dec 30, 2024Updated last year
openvino-book / PaddleOCR-VL-SFT-for-Japanese-Manga-on-RTX-3060
View on GitHub
Fine-tune PaddleOCR-VL on the Manga109s dataset for Japanese manga text recognition. The base model struggles with vertical Japanese text…
☆15Dec 7, 2025Updated 7 months ago
PRIS-CV / FourierSR
View on GitHub
[TIP 2026] "FourierSR: A Fourier Token-based Plugin for Efficient Image Super-Resolution"
☆16Feb 4, 2026Updated 5 months ago
IITB-LEAP-OCR / SPRINT
View on GitHub
SPRINT: Script-agnostic Structure Recognition in Tables
☆17Mar 26, 2025Updated last year
tertiarycourses / SckitLearn
View on GitHub
Exercise files for Python SciKit Learn Machine Learning Training
☆12Dec 20, 2020Updated 5 years ago
rayg1234 / pytlib
View on GitHub
A pytorch framework for building neurals networks for visual recognition, encoding, and detection tasks. The goal is to bridge the gap be…
☆10Dec 20, 2019Updated 6 years ago
LukasHedegaard / datasetops
View on GitHub
Fluent dataset operations, compatible with your favorite libraries
☆11Sep 4, 2025Updated 10 months ago
shenzy08 / PDO-eConvs
View on GitHub
Official implementation of the ICML 2020 paper "PDO-eConvs: Partial Differential Operator Based Equivariant Convolutions".
☆14Jun 2, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
WenmuZhou / OCR_DataSet
View on GitHub
收集并整理有关OCR的数据集并统一标注格式，以便实验需要
☆970Nov 28, 2023Updated 2 years ago
RNGesus-exe / Yolov8_ByteTracker_cpp
View on GitHub
Implementation of YoLov5 in C++, To detect objects in a video (.mp4)
☆15Aug 10, 2023Updated 2 years ago
HCIILAB / Scene-Text-Recognition-Recommendations
View on GitHub
Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining
☆353Nov 29, 2023Updated 2 years ago
weijiawu / SyntoReal_STD
View on GitHub
HHH
☆38May 2, 2022Updated 4 years ago
SJTU-DeepVisionLab / FreeReal
View on GitHub
[ECCV2024] Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors
☆19Sep 7, 2024Updated last year
OpenGVLab / Official-ConvMAE-Det
View on GitHub
☆18Aug 23, 2022Updated 3 years ago
yyyyyxie / DNTextSpotter
View on GitHub
[ACMMM 2024]: Official implementation of the paper "DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training"
☆38Jan 14, 2026Updated 6 months ago
Eurus-Holmes / SynthText_CH
View on GitHub
[SynthText Chinese] Improved code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural I…
☆14Dec 8, 2022Updated 3 years ago
blackprotoss / CIRI
View on GitHub
Reproducing the Past: A Dataset for Benchmarking Inscription Restoration (ACM MM'24)
☆14Oct 15, 2025Updated 9 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lcy0604 / QT-TextSR
View on GitHub
This repository is the implementation of "QT-TextSR: Enhancing scene text image super-resolution via efficient interaction with text reco…
☆20Jul 9, 2025Updated last year
jack139 / ocr-rare-chars
View on GitHub
生僻字OCR识别优化训练
☆16Feb 16, 2023Updated 3 years ago
EasonChen99 / 2D3DPoseTracking
View on GitHub
Official code for "Tracking Camera Poses in LiDAR Maps with Multi-View Geometric Constraints"
☆12Dec 3, 2024Updated last year
SCUT-DLVCLab / OCR-Reasoning
View on GitHub
[ICLR 2026] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning
☆76May 26, 2026Updated last month
svikramank / DensePose
View on GitHub
In this repo, I tried replicating the famous Facebook's DensePose R-CNN model and tried to visualize the collected DensePose-COCO datase…
☆10Sep 6, 2018Updated 7 years ago
whai362 / pan_pp_stable
View on GitHub
☆27Oct 9, 2022Updated 3 years ago
harrytea / UDoc-GAN
View on GitHub
Official PyTorch implementation for ACM MM22 "UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior"
☆25Aug 5, 2024Updated last year