emanuelevivoli / CoMix-datasetView external linksLinks
Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"
☆16Nov 20, 2024Updated last year
Alternatives and similar repositories for CoMix-dataset
Users that are interested in CoMix-dataset are comparing it to the libraries listed below
Sorting:
- [CVPR 2024 Highlight] - Stationary Representations: Optimally Approximating Compatibility and Implications for Improved Model Replacement…☆13Oct 21, 2024Updated last year
- Comics Dataset Framework for Comics Understanding☆39Sep 1, 2025Updated 5 months ago
- [ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"☆14Nov 20, 2024Updated last year
- The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"☆134Jan 2, 2025Updated last year
- [COG24] - Official repository of "OfflineMania: A Benchmark Environment for Offline Reinforcement Learning in Racing Games"☆12Jul 15, 2024Updated last year
- [IEEE TMM 2023] This is the official repo of the paper "Perceptual Quality Improvement in Videoconferencing using Keyframes-based GAN".☆17Dec 10, 2024Updated last year
- [ICCVW 2023] - Mapping Memes to Words for Multimodal Hateful Meme Classification☆27Apr 17, 2025Updated 9 months ago
- Official PyTorch Implementation of "Rethinking HTG Evaluation: Bridging Generation and Recognition" (Oral) - 1st Workshop on Critical Eva…☆17Sep 23, 2024Updated last year
- MangaLMM – Try the official demo below☆33Nov 9, 2025Updated 3 months ago
- A simple tool to estimate the reading order of comic panels☆19Nov 14, 2022Updated 3 years ago
- ☆32May 28, 2025Updated 8 months ago
- [WACV 2024] - Reference-based Restoration of Digitized Analog Videotapes☆55Feb 11, 2024Updated 2 years ago
- This repo contains the code of "Contrastive Supervised Distillation for Continual Representation Learning", Tommaso Barletti, Niccolò Bio…☆20Jul 5, 2022Updated 3 years ago
- Official repository of Manga109Dialog (ICME 2024)☆26Aug 3, 2024Updated last year
- [ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation☆64Jul 10, 2025Updated 7 months ago
- Official repository of the paper: "A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition"☆26Jul 10, 2023Updated 2 years ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆28Jul 12, 2023Updated 2 years ago
- The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation☆39May 4, 2025Updated 9 months ago
- [AAAI'25 Oral] NightReID: A Large-Scale Nighttime Person Re-Identification Benchmark☆10Jun 10, 2025Updated 8 months ago
- Local text-to-speech in your browser with Piper TTS☆16Aug 13, 2025Updated 6 months ago
- [TPAMI2025] BackMix: Regularizing Open Set Recognition by Removing Underlying Fore-Background Priors☆14Apr 23, 2025Updated 9 months ago
- Official PyTorch implementation of the WACV 2025 Oral paper "Composed Image Retrieval for Training-FREE DOMain Conversion".☆46Aug 31, 2025Updated 5 months ago
- Data Programming for Text Detection in Documents using SPEAR☆12Mar 26, 2025Updated 10 months ago
- ☆11Sep 30, 2024Updated last year
- ☆13Feb 2, 2025Updated last year
- Effective caching in differentially-private databases (SOSP '23)☆12Nov 1, 2023Updated 2 years ago
- Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals☆12May 24, 2024Updated last year
- ☆11Jul 26, 2024Updated last year
- Awesome Multimodal Fusion in Speech Emotion Recognition☆13Nov 11, 2025Updated 3 months ago
- A Pytorch Lightning WGAN-gp to generate faces☆11Jan 26, 2021Updated 5 years ago
- Synthesizing Efficient Data with Diffusion Models for Person Re-Identification Pre-Training☆11Jan 23, 2024Updated 2 years ago
- This is the official implementation for our TGRS 2024 paper "Text-Guided Diverse Image Synthesis for Long-Tailed Remote Sensing Object Cl…☆17Jul 3, 2024Updated last year
- Optocal Character Recognition (OCR / HTR) using Transformers☆11Aug 20, 2022Updated 3 years ago
- ☆14Apr 23, 2025Updated 9 months ago
- Federated Learning of Diffusion Models☆12Aug 30, 2023Updated 2 years ago
- STVQA and TextVQA OCR results from Amazon Text in Image pipeline☆11Jul 18, 2022Updated 3 years ago
- Variational Bayesian GP-LVM model in MATLAB.☆12Jun 8, 2015Updated 10 years ago
- Accompanying code for "Analyzing Vision Tranformers in Class Embedding Space" (NeurIPS '23)☆15Jun 10, 2024Updated last year
- This repository contains the implementation of the method described in our paper, "Divide and Conquer: Isolating Normal-Abnormal Attribut…☆10Apr 9, 2024Updated last year