KhaLee2307 / StrDALinks
[WACV2025] source code of StrDA: https://arxiv.org/abs/2410.09913
☆11Updated 5 months ago
Alternatives and similar repositories for StrDA
Users that are interested in StrDA are comparing it to the libraries listed below
Sorting:
- ☆12Updated 2 months ago
- UFPR-VCR: a dataset for vehicle color recognition that includes 10,039 images of vehicles in a wide range of real-world conditions, such …☆10Updated 11 months ago
- [⭐️ WACV 2025 Oral ⭐️] PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition☆19Updated 3 months ago
- Context-Informed Machine Translation of Manga using Multimodal Large Language Models☆11Updated 9 months ago
- [NeurIPS 2024] IF-Font: Ideographic Description Sequence-Following Font Generation☆21Updated 6 months ago
- Deep learning sitting posture detection based on multimodal datasets(基于深度学习的多模态坐姿检测系统)☆17Updated 9 months ago
- Various video readers for PyTorch models training and a benchmark☆11Updated 3 weeks ago
- This is the official repository for "Can GPTs Evaluate Graphic Design Based on Design Principles?".☆12Updated 7 months ago
- ☆15Updated last year
- Official implementation for ICDAR 2024 Oral paper "ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expressi…☆27Updated last year
- ☆10Updated 10 months ago
- Basic HTR concepts/modules to boost performance☆33Updated 9 months ago
- ☆20Updated 8 months ago
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆46Updated last year
- Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024☆65Updated 11 months ago
- ☆83Updated 6 months ago
- ☆25Updated last year
- TRACE: Table Reconstruction Aligned to Corner and Edges (ICDAR 2023)☆28Updated last year
- unofficial☆11Updated 11 months ago
- ☆61Updated last year
- ☆23Updated 9 months ago
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆21Updated 6 months ago
- [AAAI 2024] SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression☆66Updated 7 months ago
- This is the official implementation to the EMNLP 2024 paper: Modeling Layout Reading Order as Ordering Relations for Visually-rich Docume…☆28Updated 10 months ago
- 本项目主要是2025届浙江大学软件学院夏令营(AI营)的考核项目☆11Updated 6 months ago
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Models☆37Updated 6 months ago
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Updated 2 years ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆27Updated 2 years ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆27Updated 2 years ago
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆68Updated last year