dali92002/SSL-OCR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dali92002/SSL-OCR)

dali92002 / SSL-OCR

Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023

☆30

Alternatives and similar repositories for SSL-OCR

Users that are interested in SSL-OCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dali92002 / OCR-TR
View on GitHub
Optocal Character Recognition (OCR / HTR) using Transformers
☆11Aug 20, 2022Updated 3 years ago
dali92002 / HTRbyMatching
View on GitHub
Hadwritten Text Recognition in Few-shot Scenario
☆22Mar 25, 2023Updated 3 years ago
LARS-research / TREFE
View on GitHub
Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022
☆13Nov 25, 2022Updated 3 years ago
furkanbiten / object-bias
View on GitHub
Let there be clock in the beach - WACV 2022
☆15Nov 15, 2021Updated 4 years ago
AndresPMD / semantic_adaptive_margin
View on GitHub
WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
☆16Dec 10, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
callsys / FlowText
View on GitHub
[ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation
☆13May 13, 2023Updated 3 years ago
MelosY / CAM
View on GitHub
☆27Feb 20, 2024Updated 2 years ago
aiintelligentsystems / next-level-bert
View on GitHub
☆16Jun 14, 2024Updated 2 years ago
gsoykan / comics_text_plus
View on GitHub
Official repository of the paper: "A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition"
☆26Jul 10, 2023Updated 3 years ago
csguoh / KD-LTR
View on GitHub
[MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"
☆16Nov 3, 2023Updated 2 years ago
bytedance / E2STR
View on GitHub
The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
☆55Jun 14, 2024Updated 2 years ago
ayanban011 / SVGCraft
View on GitHub
[WACV 2026 Round 1] Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout
☆24Oct 11, 2025Updated 9 months ago
hecoding / Hyper-Modulation
View on GitHub
Official Implementation for "Transferring Unconditional to Conditional GANs with Hyper-Modulation" CVPRW 22 https://arxiv.org/abs/2112.02…
☆13Jun 28, 2022Updated 4 years ago
shuyansy / Visual-Text-Processing-survey
View on GitHub
The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""
☆103Oct 20, 2025Updated 9 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ThunderVVV / RCLSTR
View on GitHub
Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`
☆17Sep 22, 2023Updated 2 years ago
AndresPMD / Pytorch-yolo-phoc
View on GitHub
Implementation on pytorch of the code from the ECCV 2018 paper - Single Shot Scene Text Retrieval
☆13Dec 15, 2021Updated 4 years ago
dali92002 / DocEnTR
View on GitHub
DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022
☆190Jan 17, 2025Updated last year
CyrilSterling / LPV
View on GitHub
The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)
☆26Sep 3, 2023Updated 2 years ago
amazon-science / textadain-robust-recognition
View on GitHub
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
☆21Jul 26, 2022Updated 3 years ago
theitzin / FUS3D
View on GitHub
Official code of our ICCV paper "A Fast Unified System for 3D Object Detection and Tracking"
☆10Sep 29, 2023Updated 2 years ago
RichSu95 / Document_Binarization_Collection
View on GitHub
This repository is a concise collection of well known deep learning based document binarization models.
☆30Dec 24, 2022Updated 3 years ago
dali92002 / DE-GAN
View on GitHub
Document Image Enhancement with GANs - TPAMI journal
☆222Mar 24, 2023Updated 3 years ago
mwoedlinger / ecsic
View on GitHub
Official code of our WACV paper "ECSIC: Epipolar Cross Attention for Stereo Image Compression"
☆15Dec 27, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ayumiymk / DiG
View on GitHub
Official PyTorch implementation of `Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition`
☆74Feb 27, 2023Updated 3 years ago
joanrod / ocr-vqgan
View on GitHub
OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…
☆84Jan 30, 2023Updated 3 years ago
NiccoBiondi / ContrastiveSupervisedDistillation
View on GitHub
This repo contains the code of "Contrastive Supervised Distillation for Continual Representation Learning", Tommaso Barletti, Niccolò Bio…
☆20Jul 5, 2022Updated 4 years ago
DIVA-DIA / DIVA-DAF
View on GitHub
Repository for the deep-learning framework DIVA-DAF which is build with historical document image analysis in mind.
☆19Nov 7, 2024Updated last year
furkanbiten / idl_data
View on GitHub
OCR Annotations from Amazon Textract for Industry Documents Library
☆103Aug 20, 2022Updated 3 years ago
furkanbiten / SelectiveTextStyleTransfer
View on GitHub
ICDAR 2019
☆25Aug 2, 2019Updated 6 years ago
AndresPMD / Fine_Grained_Clf
View on GitHub
Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
☆25Nov 15, 2021Updated 4 years ago
AndresPMD / StacMR
View on GitHub
Scene Text Aware Cross Modal Retrieval (StacMR)
☆24Sep 3, 2021Updated 4 years ago
shuyansy / Efficient-Ambiguous-Text-Detector
View on GitHub
An official Project related to Paper "Perceiving Ambiguity and Semantics without Recognition: An Efficient and Effective Ambiguous Scene …
☆22Dec 3, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Mountchicken / Text-Recognition-on-Cross-Domain-Datasets
View on GitHub
Improved Text recognition algorithms on different text domains like scene text, handwritten, document, Chinese/English, even ancient book…
☆80Feb 4, 2023Updated 3 years ago
IITB-LEAP-OCR / TEXTRON
View on GitHub
Data Programming for Text Detection in Documents using SPEAR
☆12Mar 26, 2025Updated last year
adeline-cs / GTR
View on GitHub
Scene text recognition
☆108Jul 7, 2022Updated 4 years ago
andreagemelli / doc2graph
View on GitHub
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
☆139Oct 18, 2025Updated 9 months ago
qhnhynmm / ViOCRVQA-Dataset
View on GitHub
The largest VQA dataset for Vietnamese. Related to the text content in the image.
☆19Apr 9, 2025Updated last year
mxin262 / ESTextSpotter
View on GitHub
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
☆78Apr 9, 2024Updated 2 years ago
onealwj / MVLT
View on GitHub
PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition
☆29Nov 11, 2022Updated 3 years ago