[CVPR 2023 highlight] Towards Flexible Multi-modal Document Models
☆59Sep 7, 2023Updated 2 years ago
Alternatives and similar repositories for flex-dm
Users that are interested in flex-dm are comparing it to the libraries listed below
Sorting:
- Implementation of CanvasVAE: Learning to Generate Vector Graphic Documents, ICCV 2021☆70Mar 7, 2023Updated 2 years ago
- Official repository for "PosterLayout: A New Benchmark and Approach for Content-aware Visual-Textual Presentation Layout" (CVPR 2023).☆146Mar 31, 2025Updated 11 months ago
- Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.☆22Dec 7, 2023Updated 2 years ago
- Code for the paper "Harmonious Textual Layout Generation over Natural Images via Deep Aesthetics Learning" (TMM 2021)☆40Aug 3, 2022Updated 3 years ago
- [CVPR 2023] LayoutDM: Discrete Diffusion Model for Controllable Layout Generation☆294Oct 24, 2023Updated 2 years ago
- ☆82Feb 14, 2023Updated 3 years ago
- Layout Generation and Baseline implementations☆163Jul 10, 2022Updated 3 years ago
- Collection of Aesthetics Assessment Papers for Graphic Designs.☆35Aug 29, 2025Updated 6 months ago
- An awesome list of layout generation papers☆269Mar 22, 2025Updated 11 months ago
- [MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"☆16Nov 3, 2023Updated 2 years ago
- Official Repo of Graphist☆129Apr 23, 2024Updated last year
- [CVPR 2024 Oral] Official repository for RALF: Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation☆141Jul 6, 2024Updated last year
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆83Jan 30, 2023Updated 3 years ago
- PyTorch implementation of "LayoutTransformer: Layout Generation and Completion with Self-attention" to appear in ICCV 2021☆165Jan 25, 2022Updated 4 years ago
- Official code for paper: Desigen: A Pipeline for Controllable Design Template Generation [CVPR'24]☆75Jul 18, 2024Updated last year
- [TOG 2025] Order Matters: Learning Element Ordering for Graphic Design Generation☆20Aug 5, 2025Updated 6 months ago
- [CVPR 2021] Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach☆271Dec 2, 2023Updated 2 years ago
- ☆151Jan 31, 2024Updated 2 years ago
- This repository is the implementation of "Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Contex…☆96Feb 21, 2023Updated 3 years ago
- This is a data repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆10May 5, 2020Updated 5 years ago
- Public code release for: PosterChild: Blend-Aware Artistic Posterization (EGSR 2021) [Cheng-Kang Ted Chao, Karan Singh, Yotam Gingold]☆12Apr 29, 2024Updated last year
- The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'☆103May 16, 2025Updated 9 months ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- DreamDance: Personalized Text-to-video Generation by Combining Text-to-Image Synthesis and Motion Transfer☆14Dec 16, 2022Updated 3 years ago
- This is a repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆12Nov 21, 2022Updated 3 years ago
- OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]☆85Mar 12, 2025Updated 11 months ago
- ☆26Oct 20, 2022Updated 3 years ago
- ☆156May 8, 2025Updated 9 months ago
- Official implementation of the MM'21 paper "Constrained Graphic Layout Generation via Latent Optimization" (LayoutGAN++, CLG-LO, and Layo…☆139Jul 24, 2023Updated 2 years ago
- Official repo for LayoutGPT☆401Apr 10, 2024Updated last year
- ☆27May 22, 2021Updated 4 years ago
- BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild☆33Apr 16, 2024Updated last year
- Real-CE: A Benchmark for Chinese-English Scene Text Image Super-resolution (ICCV2023)☆96Nov 3, 2023Updated 2 years ago
- Source code of the TextLap model, a LLM for text-2-layout generation.☆17Oct 21, 2024Updated last year
- Demo scripts for Manga109☆12Nov 20, 2021Updated 4 years ago
- STIRER: A Unified Model for Low-Resolution Scene Text Image Recovery and Recognition -- ACMMM 2023☆14Dec 2, 2024Updated last year
- ☆152Dec 17, 2024Updated last year
- Official release of the benchmark in paper "VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for…☆16Aug 1, 2025Updated 7 months ago
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Sep 22, 2023Updated 2 years ago