The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"
☆134Jan 2, 2025Updated last year
Alternatives and similar repositories for awesome-comics-understanding
Users that are interested in awesome-comics-understanding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆32May 28, 2025Updated 10 months ago
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"☆17Nov 20, 2024Updated last year
- [COG24] - Official repository of "OfflineMania: A Benchmark Environment for Offline Reinforcement Learning in Racing Games"☆12Jul 15, 2024Updated last year
- [CVPR 2024 Highlight] - Stationary Representations: Optimally Approximating Compatibility and Implications for Improved Model Replacement…☆13Oct 21, 2024Updated last year
- [ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation☆64Feb 20, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Comics Dataset Framework for Comics Understanding☆41Sep 1, 2025Updated 6 months ago
- Hadwritten Text Recognition in Few-shot Scenario☆22Mar 25, 2023Updated 3 years ago
- [WACV 2024] - Reference-based Restoration of Digitized Analog Videotapes☆58Feb 11, 2024Updated 2 years ago
- WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching☆16Dec 10, 2021Updated 4 years ago
- [ICCVW 2023] - Mapping Memes to Words for Multimodal Hateful Meme Classification☆27Apr 17, 2025Updated 11 months ago
- [ICLR 2024] - Elastic Feature Consolidation for Cold Start Exemplar-Free Incremental Learning☆32May 26, 2025Updated 10 months ago
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆86Aug 6, 2025Updated 7 months ago
- Optocal Character Recognition (OCR / HTR) using Transformers☆11Aug 20, 2022Updated 3 years ago
- [IEEE TMM 2023] This is the official repo of the paper "Perceptual Quality Improvement in Videoconferencing using Keyframes-based GAN".☆17Dec 10, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆27Mar 7, 2025Updated last year
- Generate a transcript for your favourite Manga: Detect manga characters, text blocks and panels. Order panels. Cluster characters. Match …☆430Jun 27, 2025Updated 9 months ago
- [ECCV'24] NamedCurves: Learned Image Enhancement via Color Naming☆33Sep 8, 2025Updated 6 months ago
- Official Pytorch code for MANTRA - Memory Augmented Neural Trajectory Predictor (CVPR2020)☆79Aug 24, 2022Updated 3 years ago
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021☆92Jul 16, 2021Updated 4 years ago
- [ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"☆14Nov 20, 2024Updated last year
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆29Jul 12, 2023Updated 2 years ago
- A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers☆59Sep 9, 2024Updated last year
- [ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion☆196Jul 31, 2025Updated 7 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [ICIAP 2023] Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation☆62Dec 12, 2023Updated 2 years ago
- ICDAR 2019☆25Aug 2, 2019Updated 6 years ago
- Attempt at reproducing the metric from Neurips 2023 Unlearning Challenge on Kaggle. Code for training checkpoints on retain set and unlea…☆12Nov 8, 2023Updated 2 years ago
- Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Features☆25Nov 15, 2021Updated 4 years ago
- Quality-Aware Image-Text Alignment for Opinion-Unaware Image Quality Assessment☆128Mar 10, 2025Updated last year
- OCR Annotations from Amazon Textract for Industry Documents Library☆103Aug 20, 2022Updated 3 years ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Jul 26, 2022Updated 3 years ago
- Implementation on pytorch of the code from the ECCV 2018 paper - Single Shot Scene Text Retrieval☆13Dec 15, 2021Updated 4 years ago
- Code for "A Comprehensive Empirical Evaluation on Online Continual Learning" ICCVW 2023 VCL Workshop☆44Apr 8, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Official repository for our paper on "Attribution-aware Weight Transfer: A Warm-Start Initialization for Class-Incremental Semantic Segme…☆12Jan 3, 2023Updated 3 years ago
- [WACV 2024 Oral] - ARNIQA: Learning Distortion Manifold for Image Quality Assessment☆151Jul 31, 2025Updated 7 months ago
- [CVPR 2023] NEFER a Dataset for Neuromorphic Event-based Facial Expression Recognition☆28Dec 7, 2023Updated 2 years ago
- Official PyTorch implementation for "Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas", presenting the Merge-Att…☆14Jul 9, 2025Updated 8 months ago
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆11Dec 1, 2022Updated 3 years ago
- Official PyTorch Implementation of "Rethinking HTG Evaluation: Bridging Generation and Recognition" (Oral) - 1st Workshop on Critical Eva…☆17Sep 23, 2024Updated last year
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Dec 23, 2023Updated 2 years ago