Comics Dataset Framework for Comics Understanding
☆41Sep 1, 2025Updated 8 months ago
Alternatives and similar repositories for CoMix
Users that are interested in CoMix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"☆17Nov 20, 2024Updated last year
- [ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"☆14Nov 20, 2024Updated last year
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆11Dec 1, 2022Updated 3 years ago
- Official PyTorch Implementation of "Rethinking HTG Evaluation: Bridging Generation and Recognition" (Oral) - 1st Workshop on Critical Eva…☆17Sep 23, 2024Updated last year
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Nov 25, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official repository of Manga109Dialog (ICME 2024)☆28Aug 3, 2024Updated last year
- Official PyTorch implementation for "Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas", presenting the Merge-Att…☆14Jul 9, 2025Updated 9 months ago
- The official PyTorch implementation for the paper "Lightweight and Fast Real-time Image Enhancement via Decomposition of the Spatial-awar…☆31Jan 16, 2026Updated 3 months ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Jul 26, 2022Updated 3 years ago
- Effective caching in differentially-private databases (SOSP '23)☆13Nov 1, 2023Updated 2 years ago
- Python app created with the purpose of speeding up and greatly facilitating the task of cleaning and adjusting Booru-style tags, aimed at…☆12Dec 2, 2023Updated 2 years ago
- (Unstructured) Weight Pruning via Adaptive Sparsity Loss☆15Sep 28, 2022Updated 3 years ago
- ☆27Mar 7, 2025Updated last year
- Basic HTR concepts/modules to boost performance☆40Nov 30, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆83Jan 30, 2023Updated 3 years ago
- [AAAI 2025 (Oral)] SAIL: Sample-Centric In-Context Learning for Document Information Extraction☆20Dec 24, 2024Updated last year
- Datasets and Evaluation Scripts for CompHRDoc☆58Feb 25, 2025Updated last year
- Official repository of the paper: "A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition"☆26Jul 10, 2023Updated 2 years ago
- ☆89Mar 7, 2025Updated last year
- ☆27Feb 20, 2024Updated 2 years ago
- Small notebook to preprocess and evaluate images.☆14Nov 11, 2022Updated 3 years ago
- ☆19Oct 1, 2021Updated 4 years ago
- ☆12Aug 20, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆15Jul 31, 2020Updated 5 years ago
- ☆34Jan 13, 2025Updated last year
- Pytorch implementation of HTR on IAM dataset (word or line level + CTC loss)☆21Jul 28, 2022Updated 3 years ago
- ☆12Sep 15, 2024Updated last year
- Open source data for data visualization enthusiasts.☆22Dec 20, 2021Updated 4 years ago
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆18Aug 30, 2024Updated last year
- Magnification Prior: A Self-Supervised Method for Learning Representations on Breast Cancer Histopathological Images (WACV 2023)☆15Mar 13, 2023Updated 3 years ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆30Jul 12, 2023Updated 2 years ago
- A collection of AWESOME things about domain adaptive object detection.☆14Apr 15, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Some skills of English research paper writing☆16Aug 4, 2020Updated 5 years ago
- Official PyTorch implementation of the WACV 2025 Oral paper "Crafting Distribution Shifts for Validation and Training in Single Source Do…☆23Aug 31, 2025Updated 8 months ago
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation☆46Jul 22, 2025Updated 9 months ago
- This repository aims to collect the articles and codes for the Visual Storytelling (VIST) task. VIST is a vision-and-language task. It ai…☆25Mar 3, 2021Updated 5 years ago
- [IEEE TMM '25] Relighting from a Single Image: Datasets and Deep Intrinsic-based Architecture☆23May 30, 2025Updated 11 months ago
- ☆42Sep 2, 2023Updated 2 years ago
- This is the official impletations of the EMNLP Findings paper, VideoINSTA: Zero-shot Long-Form Video Understanding via Informative Spatia…☆25Apr 7, 2026Updated 3 weeks ago