Bhashini-IITJ / visualTranslationLinks
Implementation of Baseline for Scene Text-to-Scene Text Translation
☆18Updated 10 months ago
Alternatives and similar repositories for visualTranslation
Users that are interested in visualTranslation are comparing it to the libraries listed below
Sorting:
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆83Updated 3 years ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆76Updated last year
- ☆87Updated 11 months ago
- This is the official implementation to the EMNLP 2024 paper: Modeling Layout Reading Order as Ordering Relations for Visually-rich Docume…☆30Updated 3 weeks ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆28Updated 2 years ago
- This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).☆144Updated 10 months ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Updated 3 years ago
- Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023☆82Updated last year
- Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.☆22Updated 2 years ago
- ☆101Updated last year
- Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024☆85Updated last year
- ☆16Updated last year
- ☆42Updated 3 years ago
- ☆27Updated last year
- (Pattern Recognition) Pytorch implementation of “HTR-VT: Handwritten Text Recognition with Vision Transformer”☆118Updated 2 weeks ago
- Code for ACL 2024 findings paper "wav2vec-S: Adapting Pre-trained Speech Models for Streaming"☆10Updated 9 months ago
- Source code for paper: "AltDiffusion: A multilingual Text-to-Image diffusion model"☆44Updated last year
- Synthetic identity documents dataset☆33Updated 11 months ago
- Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023☆109Updated 2 years ago
- ☆16Updated last year
- ☆44Updated last year
- Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR☆80Updated 3 years ago
- Official repository of the paper: "A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition"☆26Updated 2 years ago
- ☆25Updated 10 months ago
- ☆69Updated 3 years ago
- ☆17Updated last year
- A Versatile Face Encoder for Zero-Shot Diffusion Model Personalization☆24Updated 6 months ago
- The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'☆103Updated 8 months ago
- ☆45Updated last year
- [ICCV 2023] ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules☆28Updated last year