ku21fan / CLL-STR
Cross-lingual learning in scene text recognition (ICASSP2024)
☆13Updated 5 months ago
Related projects: ⓘ
- PyTorch implementation of STR models for transfer learning in Indic Languages☆15Updated 3 years ago
- Official repository accompaying the ICDAR 2023 paper☆10Updated 11 months ago
- ☆19Updated 7 months ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆38Updated 5 months ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆16Updated 4 months ago
- Basic HTR concepts/modules to boost performance☆16Updated 2 months ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆15Updated last year
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆17Updated last year
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Updated 2 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Updated last year
- Official repository of the paper: "A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition"☆24Updated last year
- Official Implementation of SCOB [ICCV 2023]☆22Updated 10 months ago
- DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction☆12Updated last year
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆28Updated last year
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆23Updated last year
- [AAAI 2024] SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression☆53Updated 2 weeks ago
- arXiv 23 "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs"☆11Updated 7 months ago
- ☆12Updated 2 months ago
- Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, cand you can get the same (even better) result compared w…☆35Updated 2 months ago
- Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.☆16Updated last year
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆36Updated 11 months ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆23Updated last year
- ☆38Updated last year
- ☆40Updated 2 years ago
- OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR☆30Updated 3 weeks ago
- ☆17Updated last year
- ☆34Updated 2 months ago
- ☆34Updated 10 months ago
- Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR☆79Updated last year
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆16Updated 11 months ago