emanuelevivoli / CoMixLinks
Comics Dataset Framework for Comics Understanding
☆38Updated 5 months ago
Alternatives and similar repositories for CoMix
Users that are interested in CoMix are comparing it to the libraries listed below
Sorting:
- The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"☆133Updated last year
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"☆16Updated last year
- [ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"☆14Updated last year
- ☆101Updated last year
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆88Updated 11 months ago
- [NeurIPS2023] This is the official code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"☆239Updated last year
- Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024☆83Updated last year
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆82Updated 3 years ago
- Dreambooth (LoRA) with well-organized code structure. Naive adaptation from 🤗Diffusers.☆15Updated 2 years ago
- [CVPR 2024 Oral] Official repository for RALF: Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation☆140Updated last year
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆104Updated last year
- Continuous diffusion for layout generation☆52Updated 11 months ago
- [ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"☆83Updated last year
- The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'☆102Updated 8 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. [ICLR 2024]☆176Updated last month
- ☆16Updated last year
- Official code for paper: Desigen: A Pipeline for Controllable Design Template Generation [CVPR'24]☆75Updated last year
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆66Updated last year
- Official PyTorch Implementation of "Rethinking HTG Evaluation: Bridging Generation and Recognition" (Oral) - 1st Workshop on Critical Eva…☆17Updated last year
- OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]☆85Updated 10 months ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆28Updated 2 years ago
- [ECCV 2024] Official repo for UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diff…☆234Updated 11 months ago
- ☆24Updated last year
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆65Updated last year
- ☆100Updated 2 years ago
- Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023☆82Updated last year
- ☆119Updated last year
- [ICML 2025] This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"☆149Updated last year
- ☆17Updated last year
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆126Updated 7 months ago