Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024
☆93Oct 24, 2024Updated last year
Alternatives and similar repositories for DiffusionPen
Users that are interested in DiffusionPen are comparing it to the libraries listed below
Sorting:
- Official PyTorch Implementation of "Rethinking HTG Evaluation: Bridging Generation and Recognition" (Oral) - 1st Workshop on Critical Eva…☆17Sep 23, 2024Updated last year
- Effective caching in differentially-private databases (SOSP '23)☆13Nov 1, 2023Updated 2 years ago
- Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation☆520Oct 15, 2025Updated 4 months ago
- Official implementation for AAAI 2025 paper: SSAN: A Symbol Spatial-Aware Network for Handwritten Mathematical Expression Recognition☆15Jan 21, 2025Updated last year
- Official PyTorch implementation of the WACV 2025 Oral paper "Crafting Distribution Shifts for Validation and Training in Single Source Do…☆23Aug 31, 2025Updated 6 months ago
- This repo contains the official implementation of ICLR 2022 paper "It Takes Two to Tango: Mixup for Deep Metric Learning".☆36May 15, 2024Updated last year
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆11Dec 1, 2022Updated 3 years ago
- ☆27Mar 7, 2025Updated 11 months ago
- This repository contains the official implementation code of the NeurIPS 2025 paper: "Instance-Level Composed Image Retrieval".☆50Dec 22, 2025Updated 2 months ago
- Edge Augmentation for Large Scale Sketch Recognition without Sketches☆30Aug 31, 2025Updated 6 months ago
- Official PyTorch implementation and benchmark dataset for IGARSS 2024 ORAL paper: "Composed Image Retrieval for Remote Sensing"☆81Dec 21, 2024Updated last year
- Handwriting-Transformers (ICCV21)☆251Feb 23, 2024Updated 2 years ago
- ☆19Oct 1, 2021Updated 4 years ago
- A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text remova…☆271Dec 19, 2024Updated last year
- The official implementation of RS-STE proposed by our paper Recognition-Synergistic Scene Text Editing (CVPR 2025).☆29Jul 15, 2025Updated 7 months ago
- Optocal Character Recognition (OCR / HTR) using Transformers☆11Aug 20, 2022Updated 3 years ago
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"☆16Nov 20, 2024Updated last year
- In OLHWDB ,you can find the ptts files, this code can help you get the information of the ptts☆11Mar 8, 2022Updated 3 years ago
- resources for text detection, text recognition, and end to end text spotting☆11Apr 23, 2023Updated 2 years ago
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆45Apr 11, 2025Updated 10 months ago
- This repo is the official implementation of DeepCalliFont: Few-shot Chinese Calligraphy Font Synthesis by Integrating Dual-modality Gener…☆31May 11, 2024Updated last year
- [ECCV2024] PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer☆84Apr 10, 2025Updated 10 months ago
- CRNN with Self-Attention☆10Apr 8, 2018Updated 7 years ago
- The official code for "OG-HFYOLO :Orientation Gradient Guidance and Heterogeneous Feature Fusion For Deformation Table Cell Instance Segm…☆13Jul 28, 2025Updated 7 months ago
- The source code repository for the paper.☆21Sep 8, 2025Updated 5 months ago
- Official PyTorch implementation for "Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas", presenting the Merge-Att…☆14Jul 9, 2025Updated 7 months ago
- STIRER: A Unified Model for Low-Resolution Scene Text Image Recovery and Recognition -- ACMMM 2023☆14Dec 2, 2024Updated last year
- Let there be clock in the beach - WACV 2022☆15Nov 15, 2021Updated 4 years ago
- Basic HTR concepts/modules to boost performance☆39Nov 30, 2024Updated last year
- Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)☆12Mar 21, 2022Updated 3 years ago
- Another LaTex formula OCR tool☆15Feb 15, 2023Updated 3 years ago
- V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in MLLMs☆24Jul 31, 2025Updated 7 months ago
- ☆23Feb 23, 2025Updated last year
- SPRINT: Script-agnostic Structure Recognition in Tables☆16Mar 26, 2025Updated 11 months ago
- [ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"☆14Nov 20, 2024Updated last year
- [AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Lear…☆494Mar 14, 2024Updated last year
- ☆121Dec 20, 2020Updated 5 years ago
- [arXiv: 2505.12307] LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?☆35Dec 1, 2025Updated 3 months ago
- [NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching☆28May 29, 2025Updated 9 months ago