gsoykan/comics_text_plus

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/gsoykan/comics_text_plus)

gsoykan / comics_text_plus

Official repository of the paper: "A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition"

☆26

Alternatives and similar repositories for comics_text_plus

Users that are interested in comics_text_plus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

barisbatuhan / DASS_Detector
View on GitHub
Original Full Repository of the Paper: "Domain-Adaptive Self-Supervised Pre-training for Face & Body Detection in Drawings"
☆20Oct 14, 2025Updated 9 months ago
manga109 / panel-order-estimator
View on GitHub
A simple tool to estimate the reading order of comic panels
☆20Nov 14, 2022Updated 3 years ago
amazon-science / textadain-robust-recognition
View on GitHub
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
☆21Jul 26, 2022Updated 3 years ago
LARS-research / TREFE
View on GitHub
Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022
☆13Nov 25, 2022Updated 3 years ago
ku21fan / COO-Comic-Onomatopoeia
View on GitHub
COO: Comic onomatopoeia dataset (ECCV 2022)
☆95Feb 18, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
miyyer / comics
View on GitHub
COMICS data / code / annotations
☆126Feb 20, 2019Updated 7 years ago
dali92002 / SSL-OCR
View on GitHub
Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023
☆30Jul 12, 2023Updated 3 years ago
csguoh / KD-LTR
View on GitHub
[MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"
☆16Nov 3, 2023Updated 2 years ago
DCGM / SoftCTC
View on GitHub
This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135
☆19Mar 7, 2023Updated 3 years ago
Caiyuan-Zheng / Consistency_Regularization_STR
View on GitHub
It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.
☆28Jul 6, 2022Updated 4 years ago
zzyhlyoko / DCTC
View on GitHub
☆42Sep 2, 2023Updated 2 years ago
ku21fan / CLL-STR
View on GitHub
Cross-lingual learning in scene text recognition (ICASSP2024)
☆19Sep 29, 2024Updated last year
IITB-LEAP-OCR / TEXTRON
View on GitHub
Data Programming for Text Detection in Documents using SPEAR
☆12Mar 26, 2025Updated last year
MelosY / CAM
View on GitHub
☆27Feb 20, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
joanrod / ocr-vqgan
View on GitHub
OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…
☆84Jan 30, 2023Updated 3 years ago
csitfun / ConTRoL-dataset
View on GitHub
Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"
☆11Nov 18, 2022Updated 3 years ago
NickLucche / image_segmentation
View on GitHub
Image Segmentation using k-means, n-cuts and superpixels
☆11Mar 31, 2019Updated 7 years ago
ihdia / seamformer
View on GitHub
Official repository accompaying the ICDAR 2023 paper
☆14Oct 3, 2023Updated 2 years ago
yangbang18 / MultiCapCLIP
View on GitHub
(ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning
☆36Aug 8, 2024Updated last year
99Franklin / DiffText
View on GitHub
☆16Jan 10, 2025Updated last year
CyrilSterling / LPV
View on GitHub
The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)
☆26Sep 3, 2023Updated 2 years ago
amazon-science / glass-text-spotting
View on GitHub
Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)
☆102Jun 28, 2024Updated 2 years ago
rubenpt91 / MP-DocVQA-Framework
View on GitHub
☆72Jan 9, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
koninik / HTG_evaluation
View on GitHub
Official PyTorch Implementation of "Rethinking HTG Evaluation: Bridging Generation and Recognition" (Oral) - 1st Workshop on Critical Eva…
☆17Sep 23, 2024Updated last year
NishantBhavsar / Supplement-Sales-Prediction
View on GitHub
Forecast sales for 350+ supplement retail chain stores for next 2 months. 2nd Rank solution.
☆12Sep 20, 2021Updated 4 years ago
HCIILAB / LAST
View on GitHub
Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition
☆28Aug 29, 2023Updated 2 years ago
emanuelevivoli / CoMix-dataset
View on GitHub
Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"
☆18Nov 20, 2024Updated last year
kartikgill / taco-box
View on GitHub
An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR
☆15Dec 4, 2021Updated 4 years ago
thanhnghiadk / syntactic_HME_generation
View on GitHub
This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.
☆14Feb 24, 2022Updated 4 years ago
danielnbarbosa / soccer_twos
View on GitHub
MADDPG agent with collaboration and competition
☆12Nov 9, 2018Updated 7 years ago
jdfxzzy / DPMN
View on GitHub
Improving Scene Text Image Super-Resolution via Dual Prior Modulation Network (AAAI 2023)
☆61Aug 20, 2024Updated last year
SII-sc22mc / DocFusion
View on GitHub
A Unified Framework for Document Parsing Tasks (Including Document Layout Analysis, OCR, Formula Recognition, and Table Recognition)
☆15Jul 1, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
himanshututeja1998 / Textual-Entailment-Using-BERT
View on GitHub
Textual Entailment Using Pytorch BERT pretrained model
☆11Oct 17, 2022Updated 3 years ago
EDM-Research / VATr-pp
View on GitHub
☆18Jul 9, 2024Updated 2 years ago
wzk1015 / CNMT
View on GitHub
[AAAI 2021] Confidence-aware Non-repetitive Multimodal Transformers for TextCaps
☆24Mar 29, 2023Updated 3 years ago
VDIGPKU / STR_TPSearch
View on GitHub
☆21Mar 15, 2022Updated 4 years ago
jarobyte91 / post_ocr_correction
View on GitHub
Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"
☆39Dec 2, 2023Updated 2 years ago
dshea89 / tesseract-retraining-pipeline
View on GitHub
Intuitive interface for fine-tuning and retraining a Tesseract OCR language model
☆10Jul 4, 2025Updated last year
mxin262 / ESTextSpotter
View on GitHub
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
☆78Apr 9, 2024Updated 2 years ago