ihdia / seamformer
Official repository accompaying the ICDAR 2023 paper
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for seamformer
- PyTorch implementation of STR models for transfer learning in Indic Languages☆16Updated 3 years ago
- A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers☆55Updated 2 months ago
- Basic HTR concepts/modules to boost performance☆21Updated 4 months ago
- ☆25Updated 5 months ago
- ☆35Updated 4 months ago
- Hadwritten Text Recognition in Few-shot Scenario☆20Updated last year
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆70Updated 2 months ago
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆28Updated 7 months ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆41Updated 7 months ago
- ☆77Updated last year
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆17Updated last year
- This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.☆12Updated 2 years ago
- ☆22Updated 9 months ago
- ☆19Updated 3 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Updated 2 years ago
- Attention-based sequence-to-sequence model for handwritten word recognition☆56Updated 2 months ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆28Updated last year
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆23Updated last year
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆39Updated 5 months ago
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆38Updated last year
- Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, cand you can get the same (even better) result compared w…☆41Updated 4 months ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Updated 2 years ago
- ☆35Updated last year
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆16Updated 6 months ago
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆50Updated 5 months ago
- Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classi…☆65Updated 8 months ago
- ☆18Updated last year
- Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR☆79Updated last year
- ☆15Updated last year
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Updated 2 years ago