The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"
β135Jan 2, 2025Updated last year
Alternatives and similar repositories for awesome-comics-understanding
Users that are interested in awesome-comics-understanding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"β17Nov 20, 2024Updated last year
- [ICDAR 2024] (Best Student Paperπ) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creationβ15Sep 6, 2024Updated last year
- [COG24] - Official repository of "OfflineMania: A Benchmark Environment for Offline Reinforcement Learning in Racing Games"β12Jul 15, 2024Updated last year
- [CVPR 2024 Highlight] - Stationary Representations: Optimally Approximating Compatibility and Implications for Improved Model Replacementβ¦β13Oct 21, 2024Updated last year
- [ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillationβ63Feb 20, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Comics Dataset Framework for Comics Understandingβ41Sep 1, 2025Updated 8 months ago
- Let there be clock in the beach - WACV 2022β15Nov 15, 2021Updated 4 years ago
- Hadwritten Text Recognition in Few-shot Scenarioβ22Mar 25, 2023Updated 3 years ago
- [WACV 2026 Round 1] Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layoutβ23Oct 11, 2025Updated 6 months ago
- [WACV 2024] - Reference-based Restoration of Digitized Analog Videotapesβ59Feb 11, 2024Updated 2 years ago
- WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matchingβ16Dec 10, 2021Updated 4 years ago
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.β137Oct 18, 2025Updated 6 months ago
- [ICCVW 2023] - Mapping Memes to Words for Multimodal Hateful Meme Classificationβ27Apr 17, 2025Updated last year
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) datasetβ87Aug 6, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Optocal Character Recognition (OCR / HTR) using Transformersβ11Aug 20, 2022Updated 3 years ago
- [ICLR 2024] - Elastic Feature Consolidation for Cold Start Exemplar-Free Incremental Learningβ34May 26, 2025Updated 11 months ago
- [IEEE TMM 2023] This is the official repo of the paper "Perceptual Quality Improvement in Videoconferencing using Keyframes-based GAN".β17Dec 10, 2024Updated last year
- β27Mar 7, 2025Updated last year
- [ECCV'24] NamedCurves: Learned Image Enhancement via Color Namingβ34Sep 8, 2025Updated 8 months ago
- Official Pytorch code for MANTRA - Memory Augmented Neural Trajectory Predictor (CVPR2020)β78Aug 24, 2022Updated 3 years ago
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021β93Jul 16, 2021Updated 4 years ago
- [ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"β14Nov 20, 2024Updated last year
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023β30Jul 12, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversionβ197Jul 31, 2025Updated 9 months ago
- [ICIAP 2023] Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generationβ62Dec 12, 2023Updated 2 years ago
- ICDAR 2019β25Aug 2, 2019Updated 6 years ago
- Attempt at reproducing the metric from Neurips 2023 Unlearning Challenge on Kaggle. Code for training checkpoints on retain set and unleaβ¦β12Nov 8, 2023Updated 2 years ago
- Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Featuresβ25Nov 15, 2021Updated 4 years ago
- Quality-Aware Image-Text Alignment for Opinion-Unaware Image Quality Assessmentβ130Mar 10, 2025Updated last year
- β18Sep 14, 2024Updated last year
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizersβ21Jul 26, 2022Updated 3 years ago
- Official repository of the paper: "A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition"β26Jul 10, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implementation on pytorch of the code from the ECCV 2018 paper - Single Shot Scene Text Retrievalβ13Dec 15, 2021Updated 4 years ago
- Code for "A Comprehensive Empirical Evaluation on Online Continual Learning" ICCVW 2023 VCL Workshopβ44Apr 8, 2024Updated 2 years ago
- DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022β187Jan 17, 2025Updated last year
- Official repository for our paper on "Attribution-aware Weight Transfer: A Warm-Start Initialization for Class-Incremental Semantic Segmeβ¦β12Jan 3, 2023Updated 3 years ago
- [CVPR 2023] NEFER a Dataset for Neuromorphic Event-based Facial Expression Recognitionβ30Dec 7, 2023Updated 2 years ago
- [WACV 2024 Oral] - ARNIQA: Learning Distortion Manifold for Image Quality Assessmentβ152Jul 31, 2025Updated 9 months ago
- Official PyTorch implementation for "Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas", presenting the Merge-Attβ¦β14Jul 9, 2025Updated 10 months ago