[CVPR 2023 highlight] Towards Flexible Multi-modal Document Models
☆59Sep 7, 2023Updated 2 years ago
Alternatives and similar repositories for flex-dm
Users that are interested in flex-dm are comparing it to the libraries listed below
Sorting:
- Implementation of CanvasVAE: Learning to Generate Vector Graphic Documents, ICCV 2021☆70Mar 7, 2023Updated 3 years ago
- [CVPR 2023] LayoutDM: Discrete Diffusion Model for Controllable Layout Generation☆293Oct 24, 2023Updated 2 years ago
- Official repository for "PosterLayout: A New Benchmark and Approach for Content-aware Visual-Textual Presentation Layout" (CVPR 2023).☆147Mar 31, 2025Updated 11 months ago
- ☆82Feb 14, 2023Updated 3 years ago
- ☆152Jan 31, 2024Updated 2 years ago
- Layout Generation and Baseline implementations☆163Jul 10, 2022Updated 3 years ago
- An awesome list of layout generation papers☆269Mar 22, 2025Updated last year
- This is a data repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆10May 5, 2020Updated 5 years ago
- Collection of Aesthetics Assessment Papers for Graphic Designs.☆35Aug 29, 2025Updated 6 months ago
- Code for the paper "Harmonious Textual Layout Generation over Natural Images via Deep Aesthetics Learning" (TMM 2021)☆40Aug 3, 2022Updated 3 years ago
- PyTorch implementation of "LayoutTransformer: Layout Generation and Completion with Self-attention" to appear in ICCV 2021☆166Jan 25, 2022Updated 4 years ago
- Official Repo of Graphist☆128Apr 23, 2024Updated last year
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆83Jan 30, 2023Updated 3 years ago
- [MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"☆16Nov 3, 2023Updated 2 years ago
- This is a repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆12Nov 21, 2022Updated 3 years ago
- Cheng-Fu Yang*, Wan-Cyuan Fan*, Fu-En Yang, Yu-Chiang Frank Wang, "LayoutTransformer: Scene Layout Generation with Conceptual and Spatial…☆63Apr 3, 2022Updated 3 years ago
- This is the official repository for "Can GPTs Evaluate Graphic Design Based on Design Principles?".☆13Feb 10, 2025Updated last year
- Official code for paper: Desigen: A Pipeline for Controllable Design Template Generation [CVPR'24]☆75Jul 18, 2024Updated last year
- ☆207Jan 6, 2025Updated last year
- [CVPR 2024 Oral] Official repository for RALF: Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation☆143Jul 6, 2024Updated last year
- ☆157May 8, 2025Updated 10 months ago
- The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'☆102May 16, 2025Updated 10 months ago
- [CVPR 2021] Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach☆271Dec 2, 2023Updated 2 years ago
- Resources about Design + AI (papers, datasets, events, companies, etc.)☆63Mar 17, 2021Updated 5 years ago
- ☆28May 22, 2021Updated 4 years ago
- Official repo for LayoutGPT☆401Apr 10, 2024Updated last year
- Official implementation of the MM'21 paper "Constrained Graphic Layout Generation via Latent Optimization" (LayoutGAN++, CLG-LO, and Layo…☆139Jul 24, 2023Updated 2 years ago
- This repository is the implementation of "Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Contex…☆96Feb 21, 2023Updated 3 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- Text-To-Image Generation with Chinese Characters☆133Jul 20, 2023Updated 2 years ago
- BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild☆33Apr 16, 2024Updated last year
- This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).☆144Apr 11, 2025Updated 11 months ago
- https://cyberagent.connpass.com/event/270424/☆10Mar 5, 2023Updated 3 years ago
- DreamDance: Personalized Text-to-video Generation by Combining Text-to-Image Synthesis and Motion Transfer☆14Dec 16, 2022Updated 3 years ago
- ☆29Sep 12, 2022Updated 3 years ago
- STIRER: A Unified Model for Low-Resolution Scene Text Image Recovery and Recognition -- ACMMM 2023☆14Dec 2, 2024Updated last year
- ☆43Sep 12, 2024Updated last year
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- Official release of the benchmark in paper "VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for…☆16Aug 1, 2025Updated 7 months ago