Generate a transcript for your favourite Manga: Detect manga characters, text blocks and panels. Order panels. Cluster characters. Match texts to their speakers. Perform OCR.
☆430Jun 27, 2025Updated 8 months ago
Alternatives and similar repositories for magi
Users that are interested in magi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"☆134Jan 2, 2025Updated last year
- Official repository of Manga109Dialog (ICME 2024)☆28Aug 3, 2024Updated last year
- Various annotations of Manga109 dataset☆13Apr 23, 2025Updated 11 months ago
- [ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"☆14Nov 20, 2024Updated last year
- Comics Dataset Framework for Comics Understanding☆41Sep 1, 2025Updated 6 months ago
- Official repository of the paper: "A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition"☆26Jul 10, 2023Updated 2 years ago
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"☆16Nov 20, 2024Updated last year
- A simple tool to estimate the reading order of comic panels☆19Nov 14, 2022Updated 3 years ago
- The (Official) PyTorch Implementation of the paper "Deep Extraction of Manga Structural Lines"☆185Jan 10, 2025Updated last year
- 🤖 Manga Layout Analysis via Deep Learning☆21Feb 27, 2024Updated 2 years ago
- Instance segmentation for cartoon/anime characters and some visual techniques building around it.☆188Apr 29, 2025Updated 10 months ago
- COO: Comic onomatopoeia dataset (ECCV 2022)☆89Feb 18, 2025Updated last year
- Bubble Blaster removes text from speech bubbles in mangas/manhwas, made for Scanlations groups.☆132Aug 28, 2024Updated last year
- Context-Informed Machine Translation of Manga using Multimodal Large Language Models☆17Dec 3, 2024Updated last year
- Simple python API to read annotation data of Manga109☆129Mar 4, 2022Updated 4 years ago
- [NeurIPS 2024] Official Implementation of Attention Interpolation of Text-to-Image Diffusion☆108Nov 20, 2024Updated last year
- Kumiko, the Comics Cutter☆181Dec 22, 2024Updated last year
- Papers, repository and other data about anime or manga research. Please let me know if you have information that the list does not includ…☆1,231Dec 31, 2025Updated 2 months ago
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago
- Manga&Comic text detection☆328Aug 13, 2023Updated 2 years ago
- ☆17Sep 18, 2025Updated 6 months ago
- A tool for segmentation of digital images composing of multiple panels, improving performance of downstream tasks such as OCR, Object det…☆20May 7, 2023Updated 2 years ago
- YOLOv8 / YOLOv11 models and code for CG / art image processing☆20Aug 25, 2025Updated 6 months ago
- Original Full Repository of the Paper: "Domain-Adaptive Self-Supervised Pre-training for Face & Body Detection in Drawings"☆18Oct 14, 2025Updated 5 months ago
- AnimeDiffusion: Anime Diffusion Colorization☆77Jan 7, 2024Updated 2 years ago
- Segmentation of text in manga images☆139Feb 6, 2021Updated 5 years ago
- Line thickness normalization network using in the SIGGRPAH 2018 paper "Real-Time Data-Driven Interactive Rough Sketch Inking".☆29May 19, 2019Updated 6 years ago
- ☆13May 10, 2025Updated 10 months ago
- ☆199Mar 18, 2023Updated 3 years ago
- Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)☆9,561Mar 11, 2026Updated last week
- The repository provides code for training the SegmentAnything Model (SAM) for predicting frame polygons in comic books☆57Mar 14, 2024Updated 2 years ago
- Create RP training data from a VN, using GPT-4☆18Nov 2, 2023Updated 2 years ago
- Official Code for MotionCtrl [SIGGRAPH 2024]☆1,492Feb 19, 2025Updated last year
- API for AnimeRun dataset☆92Aug 2, 2023Updated 2 years ago
- This repository contains the official PyTorch implementation of inkn'hue: Enhancing Manga Colorization from Multiple Priors with Alignmen…☆35Jan 23, 2024Updated 2 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- Scalable group inference for generating high quality and diverse images with diffusion models.☆42Aug 31, 2025Updated 6 months ago
- Papers, datasets, and resources related to 2D cartoon video research. Contributions welcome.☆193Dec 8, 2025Updated 3 months ago
- Front end ComfyUI nodes for CartoonSegmentation☆17May 22, 2024Updated last year