KhaLee2307 / StrDALinks
[WACV2025] source code of StrDA: https://arxiv.org/abs/2410.09913
☆12Updated 9 months ago
Alternatives and similar repositories for StrDA
Users that are interested in StrDA are comparing it to the libraries listed below
Sorting:
- UFPR-VCR: a dataset for vehicle color recognition that includes 10,039 images of vehicles in a wide range of real-world conditions, such …☆10Updated last year
- Various video readers for PyTorch models training and a benchmark☆12Updated last week
- [⭐️ WACV 2025 Oral ⭐️] PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition☆29Updated 8 months ago
- Official Pytorch Implementation of "Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generati…☆11Updated 5 months ago
- ☆12Updated 6 months ago
- ☆10Updated last year
- Context-Informed Machine Translation of Manga using Multimodal Large Language Models☆17Updated last year
- This is the official repository for "Can GPTs Evaluate Graphic Design Based on Design Principles?".☆13Updated last year
- ☆15Updated last year
- PyTorch implementation of the paper: "What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Vision-Language Models." …☆10Updated 11 months ago
- unofficial☆12Updated last year
- Official implementation for P2SAM (ACM MM 2024)☆14Updated last year
- A comprehensive list [Hi-SAM@TPAMI'24, GoMatching@NeurIPS'24, DeepSolo(++)@ CVPR'23, DPText-DETR@AAAI'23, I3CL@IJCV'22] of our research w…☆92Updated last year
- Official PyTorch Implementation of "Rethinking HTG Evaluation: Bridging Generation and Recognition" (Oral) - 1st Workshop on Critical Eva…☆17Updated last year
- ☆89Updated last year
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆71Updated last year
- [NeurIPS 2024] IF-Font: Ideographic Description Sequence-Following Font Generation☆34Updated 10 months ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Updated last year
- Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023☆82Updated last year
- ☆25Updated last year
- Detecting Omissions in Geographic Maps through Computer Vision (MAPR'24)☆23Updated last year
- 2D-TPE: Two-Dimensional Positional Encoding Enhances Table Understanding for Large Language Models (WWW 2025)☆10Updated 9 months ago
- Official implementation for ICDAR 2024 Oral paper "ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expressi…☆28Updated last year
- The official repo for the technical report "Scalable Mask Annotation for Video Text Spotting"☆16Updated 2 years ago
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Updated 2 years ago
- [AAAI 2024] SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression☆67Updated 11 months ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆19Updated 10 months ago
- [ICASSP2024] An official implement of the paper "EFFICIENT SCENE TEXT IMAGE SUPER-RESOLUTION WITH SEMANTIC GUIDANCE"☆25Updated last year
- ☆27Updated last year
- Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024☆85Updated last year