[ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"
☆14Nov 20, 2024Updated last year
Alternatives and similar repositories for ComiCap
Users that are interested in ComiCap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch Implementation of "Rethinking HTG Evaluation: Bridging Generation and Recognition" (Oral) - 1st Workshop on Critical Eva…☆17Sep 23, 2024Updated last year
- Official PyTorch implementation for "Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas", presenting the Merge-Att…☆15Jul 9, 2025Updated 11 months ago
- Effective caching in differentially-private databases (SOSP '23)☆13Nov 1, 2023Updated 2 years ago
- The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"☆139Jan 2, 2025Updated last year
- Various annotations of Manga109 dataset☆13Apr 23, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official PyTorch implementation of the WACV 2025 Oral paper "Composed Image Retrieval for Training-FREE DOMain Conversion".☆46Aug 31, 2025Updated 10 months ago
- The official Python SDK for FastLabel API, the Data Platform for AI☆16Updated this week
- Learning to Count without Annotations☆23May 24, 2024Updated 2 years ago
- ☆18Oct 1, 2021Updated 4 years ago
- Official Pytorch code for MANTRA - Memory Augmented Neural Trajectory Predictor (CVPR2020)☆78Aug 24, 2022Updated 3 years ago
- Pytorch implementation of HTR on IAM dataset (word or line level + CTC loss)☆21Jul 28, 2022Updated 3 years ago
- Implementation of the paper: "BRAVE : Broadening the visual encoding of vision-language models"☆26Jun 22, 2026Updated last week
- Official repository of Manga109Dialog (ICME 2024)☆29Aug 3, 2024Updated last year
- Generate a transcript for your favourite Manga: Detect manga characters, text blocks and panels. Order panels. Cluster characters. Match …☆454Jun 27, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Deep learning Framework from scratch.☆11Jul 23, 2025Updated 11 months ago
- Official PyTorch implementation of the WACV 2025 Oral paper "Crafting Distribution Shifts for Validation and Training in Single Source Do…☆23Aug 31, 2025Updated 10 months ago
- This repository aims to collect the articles and codes for the Visual Storytelling (VIST) task. VIST is a vision-and-language task. It ai…☆26Mar 3, 2021Updated 5 years ago
- Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024☆103Oct 24, 2024Updated last year
- Benchmark gene representations from different model families☆16May 29, 2025Updated last year
- A telegram bot that sends you a message when the GPU is in use☆11May 27, 2024Updated 2 years ago
- Basic HTR concepts/modules to boost performance☆41Nov 30, 2024Updated last year
- [2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control☆104Mar 16, 2025Updated last year
- AlignCLIP: Improving Cross-Modal Alignment in CLIP (ICLR 2025)☆66Mar 1, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Optocal Character Recognition (OCR / HTR) using Transformers☆11Aug 20, 2022Updated 3 years ago
- Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals; ACL 2024☆13May 24, 2024Updated 2 years ago
- [ICLR 2022] Official implementation of "It Takes Two to Tango: Mixup for Deep Metric Learning".☆36May 15, 2024Updated 2 years ago
- Let there be clock in the beach - WACV 2022☆15Nov 15, 2021Updated 4 years ago
- STVQA and TextVQA OCR results from Amazon Text in Image pipeline☆12Jul 18, 2022Updated 3 years ago
- This is a repository for the Geospatial Data Abstraction Library (GDAL) and it's applications, examples and discussions in the world of s…☆10May 28, 2023Updated 3 years ago
- [CVPR 2024 Highlight] - Stationary Representations: Optimally Approximating Compatibility and Implications for Improved Model Replacement…☆13Oct 21, 2024Updated last year
- ☆16Jun 14, 2024Updated 2 years ago
- LocalPlexity is a lite version of Perplexity aimed at 100% privacy and openness. Everything is done locally, in your browser, from search…☆20Aug 12, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- don't materialize, just rasterize!☆17May 11, 2026Updated last month
- Code for our NeurIPS´24 paper☆38Oct 28, 2024Updated last year
- FreeDA: Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation (CVPR 2024)☆51Aug 28, 2024Updated last year
- Statistician is a framework of tools for generating statistical summaries of large collections of EO data managed in an ODC instance.☆13Jun 10, 2026Updated 3 weeks ago
- Python code to help read ASEG GDF2 packages☆11Feb 17, 2026Updated 4 months ago
- Data access library for the MeerKAT radio telescope☆14Jun 9, 2026Updated 3 weeks ago
- TiTiler demo APP with Sentinel and Landsat AWS Public Datasets☆15Jan 29, 2024Updated 2 years ago