emanuelevivoli / awesome-comics-understandingLinks
The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"
☆117Updated 5 months ago
Alternatives and similar repositories for awesome-comics-understanding
Users that are interested in awesome-comics-understanding are comparing it to the libraries listed below
Sorting:
- ☆24Updated 3 months ago
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆84Updated 4 months ago
- [ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"☆79Updated last year
- [ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion☆175Updated last year
- Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout☆20Updated 4 months ago
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆69Updated 10 months ago
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆121Updated last week
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆25Updated last year
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)☆119Updated last year
- [CVPR 2024 Highlight] - Stationary Representations: Optimally Approximating Compatibility and Implications for Improved Model Replacement…☆13Updated 8 months ago
- 👀 Visual Instruction Inversion: Image Editing via Visual Prompting (NeurIPS 2023)☆91Updated last year
- Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)☆172Updated 11 months ago
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models☆71Updated last year
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆89Updated 6 months ago
- [ECCV 2024] Official repo for UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diff…☆229Updated 4 months ago
- Densely Captioned Images (DCI) dataset repository.☆185Updated 11 months ago
- ☆125Updated 11 months ago
- Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing (NeurIPS 2023)☆105Updated last year
- [WACV 2024] Training-Free Layout Control with Cross-Attention Guidance☆257Updated last year
- Comics Dataset Framework for Comics Understanding☆23Updated 3 months ago
- [ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"☆12Updated 7 months ago
- ☆110Updated 5 months ago
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆84Updated last year
- FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions☆55Updated last year
- Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024☆49Updated 8 months ago
- Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.☆100Updated 3 months ago
- ☆80Updated 7 months ago
- ☆97Updated last year
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆136Updated last year
- Code for the paper Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models @ CVPR 2024☆66Updated last year