Comics Dataset Framework for Comics Understanding
☆41Sep 1, 2025Updated 7 months ago
Alternatives and similar repositories for CoMix
Users that are interested in CoMix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"☆17Nov 20, 2024Updated last year
- [ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"☆14Nov 20, 2024Updated last year
- Various annotations of Manga109 dataset☆13Apr 23, 2025Updated 11 months ago
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆11Dec 1, 2022Updated 3 years ago
- Official repository of Manga109Dialog (ICME 2024)☆28Aug 3, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official PyTorch implementation for "Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas", presenting the Merge-Att…☆14Jul 9, 2025Updated 9 months ago
- Generate a transcript for your favourite Manga: Detect manga characters, text blocks and panels. Order panels. Cluster characters. Match …☆432Jun 27, 2025Updated 9 months ago
- Effective caching in differentially-private databases (SOSP '23)☆13Nov 1, 2023Updated 2 years ago
- (Unstructured) Weight Pruning via Adaptive Sparsity Loss☆15Sep 28, 2022Updated 3 years ago
- ☆27Mar 7, 2025Updated last year
- Basic HTR concepts/modules to boost performance☆40Nov 30, 2024Updated last year
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆83Jan 30, 2023Updated 3 years ago
- [AAAI 2025 (Oral)] SAIL: Sample-Centric In-Context Learning for Document Information Extraction☆19Dec 24, 2024Updated last year
- [CVPR 2024 Highlight] - Stationary Representations: Optimally Approximating Compatibility and Implications for Improved Model Replacement…☆13Oct 21, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation☆15Sep 6, 2024Updated last year
- Official repository of the paper: "A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition"☆26Jul 10, 2023Updated 2 years ago
- Create RP training data from a VN, using GPT-4☆18Nov 2, 2023Updated 2 years ago
- The official Python SDK for FastLabel API, the Data Platform for AI☆16Updated this week
- ☆27Feb 20, 2024Updated 2 years ago
- COMICS data / code / annotations☆125Feb 20, 2019Updated 7 years ago
- Small notebook to preprocess and evaluate images.☆14Nov 11, 2022Updated 3 years ago
- ☆19Oct 1, 2021Updated 4 years ago
- ☆12Aug 20, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆34Jun 22, 2023Updated 2 years ago
- [CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning☆13Jun 7, 2025Updated 10 months ago
- Poorly Applied and Misunderstood Proppian Narratological Generator☆22Nov 1, 2022Updated 3 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- Pytorch implementation of HTR on IAM dataset (word or line level + CTC loss)☆21Jul 28, 2022Updated 3 years ago
- ☆17Dec 8, 2025Updated 4 months ago
- Implementation of the paper: "BRAVE : Broadening the visual encoding of vision-language models"☆26Mar 20, 2026Updated 3 weeks ago
- Magnification Prior: A Self-Supervised Method for Learning Representations on Breast Cancer Histopathological Images (WACV 2023)☆15Mar 13, 2023Updated 3 years ago
- Mini Model Daemon☆13Nov 9, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.☆28Jul 6, 2022Updated 3 years ago
- [CVPR 2025] SAM-I2V☆36Jan 2, 2026Updated 3 months ago
- [IEEE TMM '25] Relighting from a Single Image: Datasets and Deep Intrinsic-based Architecture☆23May 30, 2025Updated 10 months ago
- ☆42Sep 2, 2023Updated 2 years ago
- Fluent CLI is an advanced command-line interface designed to interact seamlessly with multiple workflow systems like FlowiseAI, Langflow,…☆32Jan 23, 2026Updated 2 months ago
- ☆32Feb 8, 2024Updated 2 years ago
- This repo contains the code of "Contrastive Supervised Distillation for Continual Representation Learning", Tommaso Barletti, Niccolò Bio…☆20Jul 5, 2022Updated 3 years ago