Bhashini-IITJ / visualTranslationLinks
Implementation of Baseline for Scene Text-to-Scene Text Translation
☆18Updated 8 months ago
Alternatives and similar repositories for visualTranslation
Users that are interested in visualTranslation are comparing it to the libraries listed below
Sorting:
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆82Updated 2 years ago
- ☆86Updated 9 months ago
- Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.☆22Updated 2 years ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆76Updated last year
- Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023☆82Updated last year
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆28Updated 2 years ago
- ☆16Updated 11 months ago
- Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024☆81Updated last year
- This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).☆144Updated 8 months ago
- ☆40Updated 2 years ago
- Source code for paper: "AltDiffusion: A multilingual Text-to-Image diffusion model"☆43Updated last year
- ☆99Updated last year
- Synthetic identity documents dataset☆30Updated 9 months ago
- ☆45Updated last year
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Updated 3 years ago
- ☆69Updated 3 years ago
- Official implementation for the paper "Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation", publish…☆20Updated last year
- ☆26Updated last year
- Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR☆80Updated 3 years ago
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Updated 8 months ago
- (Pattern Recognition) Pytorch implementation of “HTR-VT: Handwritten Text Recognition with Vision Transformer”☆109Updated last month
- ☆17Updated last year
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆22Updated 9 months ago
- Diffusion-based markup-to-image generation☆83Updated 2 years ago
- ☆87Updated last year
- Cross-lingual learning in scene text recognition (ICASSP2024)☆18Updated last year
- Official repository accompaying the ICDAR 2023 paper☆12Updated 2 years ago
- This is the official implementation to the EMNLP 2024 paper: Modeling Layout Reading Order as Ordering Relations for Visually-rich Docume…☆28Updated last year
- DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022☆180Updated 11 months ago
- Official repository of the paper: "A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition"☆25Updated 2 years ago