emanuelevivoli/CoMix

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/emanuelevivoli/CoMix)

emanuelevivoli / CoMix

Comics Dataset Framework for Comics Understanding

☆43

Alternatives and similar repositories for CoMix

Users that are interested in CoMix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

emanuelevivoli / CoMix-dataset
View on GitHub
Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"
☆18Nov 20, 2024Updated last year
emanuelevivoli / ComiCap
View on GitHub
[ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"
☆15Nov 20, 2024Updated last year
emanuelevivoli / awesome-comics-understanding
View on GitHub
The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"
☆139Jan 2, 2025Updated last year
ragavsachdeva / magi
View on GitHub
Generate a transcript for your favourite Manga: Detect manga characters, text blocks and panels. Order panels. Cluster characters. Match …
☆459Jun 27, 2025Updated last year
barisbatuhan / DASS_Detector
View on GitHub
Original Full Repository of the Paper: "Domain-Adaptive Self-Supervised Pre-training for Face & Body Detection in Drawings"
☆20Oct 14, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
manga109 / MangaLMM
View on GitHub
MangaLMM – Try the official demo below
☆47Nov 9, 2025Updated 8 months ago
koninik / HTG_evaluation
View on GitHub
Official PyTorch Implementation of "Rethinking HTG Evaluation: Bridging Generation and Recognition" (Oral) - 1st Workshop on Critical Eva…
☆17Sep 23, 2024Updated last year
AndresPMD / Pytorch-yolo-phoc
View on GitHub
Implementation on pytorch of the code from the ECCV 2018 paper - Single Shot Scene Text Retrieval
☆13Dec 15, 2021Updated 4 years ago
aimagelab / MAD
View on GitHub
Official PyTorch implementation for "Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas", presenting the Merge-Att…
☆15Jul 9, 2025Updated last year
amazon-science / textadain-robust-recognition
View on GitHub
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
☆21Jul 26, 2022Updated 3 years ago
sangoi-exe / das-EzBooruTagEditor
View on GitHub
Python app created with the purpose of speeding up and greatly facilitating the task of cleaning and adjusting Booru-style tags, aimed at…
☆12Dec 2, 2023Updated 2 years ago
aimagelab / HWD
View on GitHub
☆27Mar 7, 2025Updated last year
georgeretsi / HTR-best-practices
View on GitHub
Basic HTR concepts/modules to boost performance
☆41Nov 30, 2024Updated last year
nijaru / sy
View on GitHub
Modern file sync tool with delta transfers, 40-79% faster than rsync
☆30Jul 1, 2026Updated 3 weeks ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
moatifbutt / awesome-diffusion-ECCV-2024
View on GitHub
List of diffusion papers accepted in ECCV 2024.
☆15Oct 17, 2024Updated last year
shalebark / anime-styled-face-alignment
View on GitHub
Facial Alignment for Anime Styled Faces
☆10Mar 26, 2021Updated 5 years ago
VDIGPKU / STR_TPSearch
View on GitHub
☆21Mar 15, 2022Updated 4 years ago
LorenzoAgnolucci / Keyframes-GAN
View on GitHub
[IEEE TMM 2023] This is the official repo of the paper "Perceptual Quality Improvement in Videoconferencing using Keyframes-based GAN".
☆17Dec 10, 2024Updated last year
aimagelab / DICE
View on GitHub
[ICCV 2025] What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models
☆15Nov 3, 2025Updated 8 months ago
aimagelab / VATr
View on GitHub
☆89Mar 7, 2025Updated last year
discus0434 / evaluate-images-to-feed-diffusion
View on GitHub
Small notebook to preprocess and evaluate images.
☆14Nov 11, 2022Updated 3 years ago
merekat / children-stories
View on GitHub
OhanashiGPT is an application that generates personalized children's stories based on parameters like age and preferences. It narrates th…
☆13Sep 24, 2024Updated last year
miyyer / comics
View on GitHub
COMICS data / code / annotations
☆126Feb 20, 2019Updated 7 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
yohan-pg / robust-unsupervised
View on GitHub
☆12Aug 20, 2024Updated last year
colinlaganier / FederatedDiffusionModels
View on GitHub
Federated Learning of Diffusion Models
☆12Aug 30, 2023Updated 2 years ago
sxfduter / ATSA
View on GitHub
☆15Jul 31, 2020Updated 5 years ago
onealwj / MVLT
View on GitHub
PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition
☆28Nov 11, 2022Updated 3 years ago
reSecureIt / negativa-ml
View on GitHub
A tool analyzing unused GPU code by machine learning workloads
☆15Oct 6, 2025Updated 9 months ago
overfitting-ai-community / basic-course
View on GitHub
온라인 강의를 수강하고 토이 프로젝트를 진행 합니다.
☆12Aug 7, 2022Updated 3 years ago
kyegomez / BRAVE-ViT-Swarm
View on GitHub
Implementation of the paper: "BRAVE : Broadening the visual encoding of vision-language models"
☆26Jun 22, 2026Updated last month
georgeretsi / HTR-ctc
View on GitHub
Pytorch implementation of HTR on IAM dataset (word or line level + CTC loss)
☆21Jul 28, 2022Updated 3 years ago
Eyeline-Labs / LiMo
View on GitHub
The official implementation of CVPR'26 paper "Lighting in Motion: Spatiotemporal HDR Lighting Estimation"
☆19May 24, 2026Updated 2 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
dali92002 / SSL-OCR
View on GitHub
Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023
☆30Jul 12, 2023Updated 3 years ago
NikosEfth / crafting-shifts
View on GitHub
Official PyTorch implementation of the WACV 2025 Oral paper "Crafting Distribution Shifts for Validation and Training in Single Source Do…
☆23Aug 31, 2025Updated 10 months ago
recursal / minmodmon
View on GitHub
Mini Model Daemon
☆13Nov 9, 2024Updated last year
microsoft / CompHRDoc
View on GitHub
Datasets and Evaluation Scripts for CompHRDoc
☆59Feb 25, 2025Updated last year
wangmengsd / writtingskills
View on GitHub
Some skills of English research paper writing
☆17Aug 4, 2020Updated 5 years ago
CyrilSterling / LPV
View on GitHub
The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)
☆26Sep 3, 2023Updated 2 years ago
Caiyuan-Zheng / Consistency_Regularization_STR
View on GitHub
It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.
☆28Jul 6, 2022Updated 4 years ago