dali92002/OCR-TR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dali92002/OCR-TR)

dali92002 / OCR-TR

Optocal Character Recognition (OCR / HTR) using Transformers

☆11

Alternatives and similar repositories for OCR-TR

Users that are interested in OCR-TR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dali92002 / SSL-OCR
View on GitHub
Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023
☆30Jul 12, 2023Updated 3 years ago
furkanbiten / object-bias
View on GitHub
Let there be clock in the beach - WACV 2022
☆15Nov 15, 2021Updated 4 years ago
dali92002 / HTRbyMatching
View on GitHub
Hadwritten Text Recognition in Few-shot Scenario
☆22Mar 25, 2023Updated 3 years ago
AndresPMD / semantic_adaptive_margin
View on GitHub
WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
☆16Dec 10, 2021Updated 4 years ago
ayanban011 / SVGCraft
View on GitHub
[WACV 2026 Round 1] Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout
☆24Oct 11, 2025Updated 9 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
theitzin / FUS3D
View on GitHub
Official code of our ICCV paper "A Fast Unified System for 3D Object Detection and Tracking"
☆10Sep 29, 2023Updated 2 years ago
mwoedlinger / ecsic
View on GitHub
Official code of our WACV paper "ECSIC: Epipolar Cross Attention for Stereo Image Compression"
☆15Dec 27, 2023Updated 2 years ago
DIVA-DIA / DIVA-DAF
View on GitHub
Repository for the deep-learning framework DIVA-DAF which is build with historical document image analysis in mind.
☆19Nov 7, 2024Updated last year
biswassanket / DocSegTr
View on GitHub
A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers
☆59Sep 9, 2024Updated last year
mongoose54 / negex
View on GitHub
Automatically exported from code.google.com/p/negex
☆14Sep 29, 2015Updated 10 years ago
Zhenhang-Li / GlyphOnly
View on GitHub
【2024 ECAI】First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending
☆14Jun 16, 2025Updated last year
ayanban011 / GraphKD
View on GitHub
[ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation
☆16Sep 6, 2024Updated last year
emanuelevivoli / awesome-comics-understanding
View on GitHub
The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"
☆139Jan 2, 2025Updated last year
furkanbiten / SelectiveTextStyleTransfer
View on GitHub
ICDAR 2019
☆25Aug 2, 2019Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
omni-us / research-seq2seq-HTR
View on GitHub
☆21Jul 24, 2019Updated 7 years ago
AndresPMD / Fine_Grained_Clf
View on GitHub
Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
☆25Nov 15, 2021Updated 4 years ago
mwoedlinger / sasic
View on GitHub
Official code of our CVPR paper "SASIC: Stereo Image Compression with Latent Shifts and Stereo Attention"
☆25Apr 11, 2024Updated 2 years ago
davidserra9 / namedcurves
View on GitHub
[ECCV'24] [TPAMI'26] NamedCurves: Learned Image Enhancement via Color Naming
☆36May 26, 2026Updated last month
coolbutuseless / triangular
View on GitHub
Decompose complex polygons into sets of triangles
☆10Oct 4, 2020Updated 5 years ago
UbiquantAI / Fleming-VL
View on GitHub
Fleming-VL: Towards Universal Medical Visual Understanding with Multimodal LLMs
☆15Nov 6, 2025Updated 8 months ago
waodng / DataGearDashboardTemplate
View on GitHub
静态大屏HTML模板，可作为看板模板导入DataGear数据可视化分析平台，制作大屏展示数据可视化看板
☆21Feb 23, 2022Updated 4 years ago
strikingly / blog
View on GitHub
☆10Jan 28, 2016Updated 10 years ago
furkanbiten / idl_data
View on GitHub
OCR Annotations from Amazon Textract for Industry Documents Library
☆103Aug 20, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
warguns / vacunacovid-catsalud-autofullfill
View on GitHub
Fill the boring catsalud covid vaccine form with a console command
☆16Jul 17, 2021Updated 5 years ago
sidgairo18 / unsupervised-style-learning
View on GitHub
This repository contains the source code, models and data files for the work titled: "Unsupervised Image Style Embeddings for Retrieval a…
☆13May 29, 2021Updated 5 years ago
Sammy20207109 / DyCo-RL
View on GitHub
DyCo-RL: Dynamic Cross-Modal Coordination for Visual Reasoning
☆18Jun 14, 2026Updated last month
andreagemelli / doc2graph
View on GitHub
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
☆139Oct 18, 2025Updated 9 months ago
shuyansy / Efficient-Ambiguous-Text-Detector
View on GitHub
An official Project related to Paper "Perceiving Ambiguity and Semantics without Recognition: An Efficient and Effective Ambiguous Scene …
☆22Dec 3, 2023Updated 2 years ago
tomiock / macrograd
View on GitHub
Deep learning Framework from scratch.
☆11Jul 23, 2025Updated last year
priba / graph_metric.pytorch
View on GitHub
Graph Metric Learning in PyTorch
☆10Apr 7, 2021Updated 5 years ago
UbiquantAI / IDO
View on GitHub
Turn every moment into momentum
☆22Jun 1, 2026Updated last month
easonnie / ChaosNLI
View on GitHub
[EMNLP 2020] Collective HumAn OpinionS on Natural Language Inference Data
☆42Apr 7, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ihdia / instance-segmentation-v1
View on GitHub
☆10Jan 22, 2023Updated 3 years ago
dali92002 / DE-GAN
View on GitHub
Document Image Enhancement with GANs - TPAMI journal
☆223Mar 24, 2023Updated 3 years ago
AILab-UniFI / cte-dataset
View on GitHub
CTE: Contextualized Table Extraction Dataset
☆17Feb 23, 2023Updated 3 years ago
leftthomas / ClipPrompt
View on GitHub
A PyTorch implementation of ClipPrompt based on CVPR 2023 paper "CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained…
☆18Nov 5, 2023Updated 2 years ago
DTU-PAS / awesome-dynn-for-cv
View on GitHub
Awesome collection of DyNN papers for Computer Vision and Sensor Fusion applications
☆35May 5, 2026Updated 2 months ago
xdu-jjgs / QingDao-ship-detection
View on GitHub
青岛船舶检测
☆13Apr 16, 2025Updated last year
wangkai930418 / HCV_IIRC
View on GitHub
code for our BMVC 2021 paper "HCV: Hierarchy-Consistency Verification for Incremental Implicitly-Refined Classification"
☆15Oct 28, 2022Updated 3 years ago