emanuelevivoli/ComiCap

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/emanuelevivoli/ComiCap)

emanuelevivoli / ComiCap

[ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"

☆15

Alternatives and similar repositories for ComiCap

Users that are interested in ComiCap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

emanuelevivoli / CoMix-dataset
View on GitHub
Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"
☆18Nov 20, 2024Updated last year
emanuelevivoli / CoMix
View on GitHub
Comics Dataset Framework for Comics Understanding
☆43Sep 1, 2025Updated 10 months ago
aimagelab / DICE
View on GitHub
[ICCV 2025] What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models
☆15Nov 3, 2025Updated 8 months ago
koninik / HTG_evaluation
View on GitHub
Official PyTorch Implementation of "Rethinking HTG Evaluation: Bridging Generation and Recognition" (Oral) - 1st Workshop on Critical Eva…
☆17Sep 23, 2024Updated last year
IVRL / ComicsDepth
View on GitHub
(WACV 2022) Estimating Image Depth in the Comics Domain
☆20Apr 25, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
georgeretsi / Seq2Emb
View on GitHub
Create handwritten word embeddings from a text recognition Seq2Seq system.
☆11Dec 1, 2022Updated 3 years ago
Ruggero1912 / Patch-ioner
View on GitHub
[CVPR 2026] Official Repository of the Paper "One Patch to Caption Them All A Unified Zero-Shot Captioning Framework"
☆15Jun 4, 2026Updated last month
emanuelevivoli / awesome-comics-understanding
View on GitHub
The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"
☆139Jan 2, 2025Updated last year
simomagi / IsoCLIP
View on GitHub
[CVPR 2026] - IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment
☆28May 29, 2026Updated last month
aai-institute / tfl-training-practical-anomaly-detection
View on GitHub
Repository of the Tranferlab Practical Anomaly Detection workshop
☆14Jun 14, 2024Updated 2 years ago
manga109 / public-annotations
View on GitHub
Various annotations of Manga109 dataset
☆13Apr 23, 2025Updated last year
moskomule / hypergrad
View on GitHub
Simple and extensible hypergradient for PyTorch
☆18Feb 23, 2023Updated 3 years ago
ciampluca / PrACo
View on GitHub
☆16May 19, 2026Updated 2 months ago
MattAlexMiracle / SmartPatch
View on GitHub
☆18Oct 1, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
NikosEfth / freedom
View on GitHub
Official PyTorch implementation of the WACV 2025 Oral paper "Composed Image Retrieval for Training-FREE DOMain Conversion".
☆46Aug 31, 2025Updated 10 months ago
fastlabel / fastlabel-python-sdk
View on GitHub
The official Python SDK for FastLabel API, the Data Platform for AI
☆16Jul 17, 2026Updated last week
tomiock / macrograd
View on GitHub
Deep learning Framework from scratch.
☆11Jul 23, 2025Updated last year
Marchetz / MANTRA-CVPR20
View on GitHub
Official Pytorch code for MANTRA - Memory Augmented Neural Trajectory Predictor (CVPR2020)
☆78Aug 24, 2022Updated 3 years ago
crowsonkb / glide-text2im
View on GitHub
GLIDE: a diffusion-based text-conditional image synthesis model
☆20Feb 16, 2022Updated 4 years ago
aimagelab / pin
View on GitHub
☆20Oct 18, 2024Updated last year
miccunifi / KDPL
View on GitHub
[ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation
☆62Feb 20, 2026Updated 5 months ago
nuvolos-cloud / PyMesh
View on GitHub
Geometry Processing Library for Python
☆23Aug 18, 2023Updated 2 years ago
AnesBenmerzoug / FreeCAD-Assembly2MuJoCo
View on GitHub
FreeCAD workbench for exporting an Assembly to MuJoCo.
☆29May 5, 2026Updated 2 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
NikosEfth / crafting-shifts
View on GitHub
Official PyTorch implementation of the WACV 2025 Oral paper "Crafting Distribution Shifts for Validation and Training in Single Source Do…
☆23Aug 31, 2025Updated 10 months ago
koninik / awesome-handwritten-text-generation
View on GitHub
This repo contains a curated list of research papers and resources focusing on Handwritten Text Generation (HTG)
☆24Jan 20, 2026Updated 6 months ago
christinec-dev / DustyTrails_RPG_Parts
View on GitHub
Codes for each section of the Dusty Trails RPG series.
☆32Aug 24, 2024Updated last year
francescortu / comp-mech
View on GitHub
Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals; ACL 2024
☆13May 24, 2024Updated 2 years ago
koninik / WordStylist
View on GitHub
Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023
☆82Jun 25, 2024Updated 2 years ago
cyd3r / notify-free-gpu
View on GitHub
A telegram bot that sends you a message when the GPU is in use
☆11May 27, 2024Updated 2 years ago
georgeretsi / HTR-best-practices
View on GitHub
Basic HTR concepts/modules to boost performance
☆41Nov 30, 2024Updated last year
aai-institute / continuiti
View on GitHub
Learning function operators with neural networks.
☆36Aug 22, 2024Updated last year
NikosEfth / im2rbte
View on GitHub
Edge Augmentation for Large Scale Sketch Recognition without Sketches
☆30Aug 31, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
furkanbiten / stvqa_amazon_ocr
View on GitHub
STVQA and TextVQA OCR results from Amazon Text in Image pipeline
☆12Jul 18, 2022Updated 4 years ago
gdal4al / gdal-examples
View on GitHub
This is a repository for the Geospatial Data Abstraction Library (GDAL) and it's applications, examples and discussions in the world of s…
☆10May 28, 2023Updated 3 years ago
WildVision-AI / WildVision-Bench
View on GitHub
☆17Oct 21, 2024Updated last year
billpsomas / metrix
View on GitHub
[ICLR 2022] Official implementation of "It Takes Two to Tango: Mixup for Deep Metric Learning".
☆36May 15, 2024Updated 2 years ago
legraphista / localplexity
View on GitHub
LocalPlexity is a lite version of Perplexity aimed at 100% privacy and openness. Everything is done locally, in your browser, from search…
☆20Aug 12, 2024Updated last year
jaisidhsingh / LoRA-CLIP
View on GitHub
Easy wrapper for inserting LoRA layers in CLIP.
☆40Jun 16, 2024Updated 2 years ago
dali92002 / OCR-TR
View on GitHub
Optocal Character Recognition (OCR / HTR) using Transformers
☆11Aug 20, 2022Updated 3 years ago