emanuelevivoli / awesome-comics-understandingLinks
The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"
☆114Updated 5 months ago
Alternatives and similar repositories for awesome-comics-understanding
Users that are interested in awesome-comics-understanding are comparing it to the libraries listed below
Sorting:
- Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout☆20Updated 4 months ago
- Comics Dataset Framework for Comics Understanding☆23Updated 3 months ago
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆85Updated 4 months ago
- ☆23Updated 3 months ago
- ☆31Updated last week
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆25Updated last year
- [ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"☆77Updated last year
- Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)☆170Updated 11 months ago
- Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024☆49Updated 7 months ago
- Official code for paper: Desigen: A Pipeline for Controllable Design Template Generation [CVPR'24]☆70Updated 10 months ago
- [CVPR 2024 Highlight] - Stationary Representations: Optimally Approximating Compatibility and Implications for Improved Model Replacement…☆13Updated 7 months ago
- [ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation☆14Updated 9 months ago
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆66Updated 9 months ago
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)☆74Updated 2 months ago
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)☆119Updated last year
- ☆93Updated 10 months ago
- 👀 Visual Instruction Inversion: Image Editing via Visual Prompting (NeurIPS 2023)☆91Updated last year
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆119Updated 4 months ago
- [ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"☆12Updated 6 months ago
- Evaluating Data Attribution for Text-to-Image Models: a visual data attribution benchmark for evaluating and learning training image inf…☆72Updated 11 months ago
- ☆95Updated last year
- [ECCV 2024 Oral] ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction☆68Updated 9 months ago
- ☆81Updated 3 months ago
- Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024☆60Updated 9 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆62Updated last year
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆88Updated 5 months ago
- LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning☆138Updated last month
- [2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control☆79Updated 2 months ago
- Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing (NeurIPS 2023)☆105Updated last year
- [ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features☆178Updated last year