omron-sinicx / scipostlayout
☆20Updated 9 months ago
Alternatives and similar repositories for scipostlayout
Users that are interested in scipostlayout are comparing it to the libraries listed below
Sorting:
- Continuous diffusion for layout generation☆42Updated 2 months ago
- OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]☆67Updated 2 months ago
- [CVPR 2023 highlight] Towards Flexible Multi-modal Document Models☆56Updated last year
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆66Updated 8 months ago
- Evaluating GPT-4o's image generation and editing ability in OCR tasks.☆44Updated last month
- ☆95Updated last year
- [NeurIPS2023] This is the official code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"☆232Updated 10 months ago
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆36Updated 8 months ago
- Implementation of CanvasVAE: Learning to Generate Vector Graphic Documents, ICCV 2021☆65Updated 2 years ago
- LayoutDiT: Exploring Content-Graphic Balance in Layout Generation with Diffusion Transformer☆42Updated 4 months ago
- Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation☆48Updated 9 months ago
- [NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching☆23Updated 5 months ago
- LayoutFlow: Flow Matching for Layout Generation [Andrade Guerreiro et al., ECCV 2024]☆32Updated this week
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆62Updated last month
- A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation☆68Updated 2 months ago
- This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).☆132Updated last month
- AnyTrans: Translate AnyText in the Image with Large Scale Models (EMNLP2024 Findings)☆16Updated 5 months ago
- ☆23Updated last year
- Text-To-Image Generation with Chinese Characters☆131Updated last year
- [2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control☆77Updated 2 months ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆81Updated 2 years ago
- ☆38Updated last year
- Official code for paper: Desigen: A Pipeline for Controllable Design Template Generation [CVPR'24]☆69Updated 10 months ago
- ☆37Updated 6 months ago
- [IJCV 2025] Smaller But Better: Unifying Layout Generation with Smaller Large Language Models☆21Updated 2 months ago
- A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo☆34Updated 9 months ago
- The official project of paper "Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing"☆63Updated 2 weeks ago
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆39Updated 7 months ago
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models☆71Updated 11 months ago
- Official Repo of Graphist☆115Updated last year