ku21fan / CLL-STR
Cross-lingual learning in scene text recognition (ICASSP2024)
☆15Updated last month
Related projects ⓘ
Alternatives and complementary repositories for CLL-STR
- Official repository accompaying the ICDAR 2023 paper☆10Updated last year
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆17Updated last year
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated last year
- PyTorch implementation of STR models for transfer learning in Indic Languages☆16Updated 3 years ago
- ☆18Updated last year
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆16Updated 6 months ago
- Official repository of the paper: "A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition"☆24Updated last year
- Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.☆16Updated last year
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆41Updated 7 months ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Updated 2 years ago
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆25Updated last year
- ☆22Updated 9 months ago
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Updated last year
- DCutMix official repo☆10Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆18Updated 3 months ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Updated 2 years ago
- Simple example of FastAPI + Celery + Triton for benchmarking☆61Updated 2 years ago
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆19Updated 2 months ago
- A dashboard for exploring timm learning rate schedulers☆18Updated last year
- ☆23Updated 3 years ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Updated last year
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆23Updated last year
- Tool to parse wiki tables from the HTML dump of Wikipedia☆11Updated 2 years ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆28Updated last year
- Official Implementation of SCOB [ICCV 2023]☆22Updated last year
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆45Updated last month
- ☆30Updated 7 months ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆36Updated 2 years ago
- ☆33Updated 6 months ago
- ☆35Updated 4 months ago