KhaLee2307 / StrDA
[WACV2025] source code of StrDA: https://arxiv.org/abs/2410.09913
☆11Updated 3 weeks ago
Alternatives and similar repositories for StrDA:
Users that are interested in StrDA are comparing it to the libraries listed below
- Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024☆45Updated 5 months ago
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆16Updated last month
- ☆24Updated last year
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆33Updated 2 weeks ago
- [ICLR 2025] CAMEx: Curvature-Aware Merging of Experts☆17Updated last month
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆23Updated last year
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆42Updated 7 months ago
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆59Updated 9 months ago
- Evaluating GPT-4o's image generation and editing ability in OCR tasks.☆28Updated this week
- ☆40Updated 8 months ago
- [2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control☆69Updated 2 weeks ago
- ☆80Updated 3 weeks ago
- ☆14Updated 8 months ago
- Vietnamese handwritten text recognition system☆17Updated 3 years ago
- Official implementation of CVPR24 paper "Gradient Alignment for Cross-Domain Face Anti-Spoofing"☆70Updated last year
- An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".☆136Updated 3 weeks ago
- Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023☆78Updated 9 months ago
- [AAAI 2024] SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression☆63Updated last month
- [ICASSP2024] An official implement of the paper "EFFICIENT SCENE TEXT IMAGE SUPER-RESOLUTION WITH SEMANTIC GUIDANCE"☆21Updated 10 months ago
- Pioneering in Vietnamese Multimodal Large Language Model☆46Updated 2 months ago
- [CVPR 2025] h-Edit: Effective and Flexible Diffusion-Based Editing via Doob’s h-Transform☆36Updated last week
- ☆37Updated last year
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Updated last year
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆25Updated 10 months ago
- ☆92Updated 8 months ago
- The official project of paper "Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing"☆59Updated 2 months ago
- Expanding on the second contribution of "Perception Prioritized Training of Diffusion Models" (CVPR'22) with an implementation and extens…☆18Updated 2 years ago
- Official PyTorch implementation of "CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection"☆19Updated last year
- ☆13Updated 2 months ago
- ☆81Updated last month