emanuelevivoli / awesome-comics-understandingLinks
The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"
☆127Updated 9 months ago
Alternatives and similar repositories for awesome-comics-understanding
Users that are interested in awesome-comics-understanding are comparing it to the libraries listed below
Sorting:
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"☆13Updated 11 months ago
- Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)☆179Updated last year
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆88Updated 8 months ago
- ☆25Updated 7 months ago
- [ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"☆82Updated last year
- [ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features☆187Updated 2 years ago
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. [ICLR 2024]☆172Updated last month
- TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering☆178Updated last year
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆133Updated 10 months ago
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆122Updated 4 months ago
- Densely Captioned Images (DCI) dataset repository.☆191Updated last year
- Data release for the ImageInWords (IIW) paper.☆220Updated 11 months ago
- ☆114Updated 9 months ago
- Dreambooth (LoRA) with well-organized code structure. Naive adaptation from 🤗Diffusers.☆14Updated 2 years ago
- 👀 Visual Instruction Inversion: Image Editing via Visual Prompting (NeurIPS 2023)☆94Updated last year
- [NeurIPS2023] This is the official code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"☆237Updated last year
- MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance (ACM MM2024)☆136Updated 6 months ago
- [ECCV 2024] Official repo for UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diff…☆231Updated 8 months ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆65Updated last year
- Evaluating Data Attribution for Text-to-Image Models: a visual data attribution benchmark for evaluating and learning training image inf…☆76Updated last year
- 🌋👵🏻 Yo'LLaVA: Your Personalized Language and Vision Assistant☆117Updated 7 months ago
- Comics Dataset Framework for Comics Understanding☆31Updated 2 months ago
- [ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion☆191Updated 3 months ago
- [WACV 2026 Round 1] Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout☆21Updated 2 weeks ago
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆97Updated 10 months ago
- ☆99Updated last year
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)☆90Updated 7 months ago
- An Pytorch implementation of the paper Key-Locked Rank One Editing for Text-to-Image Personalization☆85Updated 2 years ago
- Continuous diffusion for layout generation☆50Updated 8 months ago
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆111Updated last year