emanuelevivoli / awesome-comics-understanding
The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"
☆100Updated last month
Alternatives and similar repositories for awesome-comics-understanding:
Users that are interested in awesome-comics-understanding are comparing it to the libraries listed below
- Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout☆18Updated 2 weeks ago
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆83Updated 2 weeks ago
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆117Updated last month
- Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing (NeurIPS 2023)☆98Updated 9 months ago
- Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)☆162Updated 7 months ago
- ☆102Updated 3 weeks ago
- ☆22Updated last week
- 👀 Visual Instruction Inversion: Image Editing via Visual Prompting (NeurIPS 2023)☆88Updated last year
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆120Updated 2 months ago
- [ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"☆73Updated 9 months ago
- [ECCV 2024 Oral] ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction☆57Updated 6 months ago
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆84Updated 2 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆62Updated 9 months ago
- Evaluating Data Attribution for Text-to-Image Models: a visual data attribution benchmark for evaluating and learning training image inf…☆70Updated 7 months ago
- Densely Captioned Images (DCI) dataset repository.☆168Updated 7 months ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆93Updated 10 months ago
- ☆75Updated 2 months ago
- [ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion☆166Updated 9 months ago
- Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation☆44Updated 6 months ago
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆67Updated 2 months ago
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)☆113Updated 10 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)☆157Updated 4 months ago
- ReCo: Region-Controlled Text-to-Image Generation, CVPR 2023☆124Updated last year
- [CVPR 2024 Highlight] - Stationary Representations: Optimally Approximating Compatibility and Implications for Improved Model Replacement…☆13Updated 4 months ago
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects"☆66Updated 8 months ago
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆136Updated 9 months ago
- ☆97Updated last year
- Official code for paper: Desigen: A Pipeline for Controllable Design Template Generation [CVPR'24]☆66Updated 7 months ago