emanuelevivoli / CoMixLinks
Comics Dataset Framework for Comics Understanding
☆33Updated 3 months ago
Alternatives and similar repositories for CoMix
Users that are interested in CoMix are comparing it to the libraries listed below
Sorting:
- The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"☆131Updated 11 months ago
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"☆15Updated last year
- [ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"☆14Updated last year
- ☆99Updated last year
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆88Updated 10 months ago
- [NeurIPS2023] This is the official code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"☆237Updated last year
- Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024☆82Updated last year
- [ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"☆82Updated last year
- ☆16Updated 11 months ago
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆82Updated 2 years ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆65Updated last year
- Official PyTorch Implementation of "Rethinking HTG Evaluation: Bridging Generation and Recognition" (Oral) - 1st Workshop on Critical Eva…☆17Updated last year
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆102Updated last year
- [CVPR 2023] SketchXAI: A First Look at Explainability for Human Sketches☆25Updated last year
- Diffusion Layout Transformer implementation.☆63Updated 2 years ago
- [CVPR 2024 Oral] Official repository for RALF: Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation☆139Updated last year
- [ECCV 2024] Official repo for UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diff…☆233Updated 10 months ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆66Updated last year
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text…☆120Updated 2 years ago
- Evaluating Data Attribution for Text-to-Image Models: a visual data attribution benchmark for evaluating and learning training image inf…☆78Updated last year
- ☆99Updated last year
- ☆86Updated 9 months ago
- Unofficial implementation of paper NULL-text Inversion for Editing Real Images using Guided Diffusion Models ( https://arxiv.org/abs/2211…☆87Updated 3 years ago
- The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'☆101Updated 7 months ago
- The official repository of "Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models".☆51Updated last year
- [WACV 2026 Round 1] Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout☆21Updated 2 months ago
- Official code repo for "Editing Implicit Assumptions in Text-to-Image Diffusion Models"☆87Updated 2 years ago
- [WACV 2024] Official Implementation of TIAM - A Metric for Evaluating Alignment in Text-to-Image Generation☆19Updated 10 months ago
- ☆23Updated last year
- Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…☆57Updated last year