emanuelevivoli / CoMixLinks
Comics Dataset Framework for Comics Understanding
☆31Updated 2 months ago
Alternatives and similar repositories for CoMix
Users that are interested in CoMix are comparing it to the libraries listed below
Sorting:
- The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"☆127Updated 9 months ago
- ☆98Updated last year
- [NeurIPS2023] This is the official code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"☆237Updated last year
- [ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"☆14Updated 11 months ago
- Official code for paper: Desigen: A Pipeline for Controllable Design Template Generation [CVPR'24]☆71Updated last year
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"☆13Updated 11 months ago
- ☆99Updated last year
- OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]☆80Updated 7 months ago
- ☆16Updated 9 months ago
- [ECCV 2024] Official repo for UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diff…☆231Updated 8 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆64Updated last year
- Diffusion Layout Transformer implementation.☆62Updated 2 years ago
- ☆87Updated last year
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆81Updated 2 years ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆65Updated last year
- ☆21Updated 2 years ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆81Updated last year
- [CVPR 2023 highlight] Towards Flexible Multi-modal Document Models☆59Updated 2 years ago
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆32Updated last year
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆77Updated 7 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. [ICLR 2024]☆172Updated last month
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)☆90Updated 7 months ago
- The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'☆100Updated 5 months ago
- Minimal Differentiable Image Reward Functions☆97Updated 2 months ago
- Continuous diffusion for layout generation☆50Updated 8 months ago
- [ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"☆82Updated last year
- Implementation of MDP: A Generalized Framework for Text-Guided Image Editing by Manipulating the Diffusion Path☆68Updated 2 years ago
- This is an official repository for the paper, NoiseCollage, which is a revolutionary extension of text-to-image diffusion models for layo…☆61Updated last year
- Dreambooth (LoRA) with well-organized code structure. Naive adaptation from 🤗Diffusers.☆14Updated 2 years ago
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆108Updated last year