Generate a transcript for your favourite Manga: Detect manga characters, text blocks and panels. Order panels. Cluster characters. Match texts to their speakers. Perform OCR.
☆426Jun 27, 2025Updated 8 months ago
Alternatives and similar repositories for magi
Users that are interested in magi are comparing it to the libraries listed below
Sorting:
- The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"☆133Jan 2, 2025Updated last year
- Official repository of Manga109Dialog (ICME 2024)☆27Aug 3, 2024Updated last year
- Comics Dataset Framework for Comics Understanding☆39Sep 1, 2025Updated 6 months ago
- Finding a panel inside a comic page is the hardest thing I've ever done in computer science!☆162Jan 16, 2026Updated last month
- Official repository of the paper: "A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition"☆26Jul 10, 2023Updated 2 years ago
- Official Implementation of "Learning Inclusion Matching for Animation Paint Bucket Colorization"☆301Jun 26, 2025Updated 8 months ago
- [ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"☆14Nov 20, 2024Updated last year
- COO: Comic onomatopoeia dataset (ECCV 2022)☆89Feb 18, 2025Updated last year
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"☆16Nov 20, 2024Updated last year
- A simple tool to estimate the reading order of comic panels☆19Nov 14, 2022Updated 3 years ago
- Instance segmentation for cartoon/anime characters and some visual techniques building around it.☆186Apr 29, 2025Updated 10 months ago
- [NeurIPS 2024] Official Implementation of Attention Interpolation of Text-to-Image Diffusion☆107Nov 20, 2024Updated last year
- Various annotations of Manga109 dataset☆13Apr 23, 2025Updated 10 months ago
- The (Official) PyTorch Implementation of the paper "Deep Extraction of Manga Structural Lines"☆183Jan 10, 2025Updated last year
- A tool for segmentation of digital images composing of multiple panels, improving performance of downstream tasks such as OCR, Object det…☆20May 7, 2023Updated 2 years ago
- Kumiko, the Comics Cutter☆180Dec 22, 2024Updated last year
- Papers, repository and other data about anime or manga research. Please let me know if you have information that the list does not includ…☆1,224Dec 31, 2025Updated 2 months ago
- C# console-app to split a comic-page into it's separate panels☆13Jun 22, 2020Updated 5 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- A quick test using a Stable Diffusion server and Godot 4☆11Mar 17, 2023Updated 2 years ago
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago
- 🤖 Manga Layout Analysis via Deep Learning☆21Feb 27, 2024Updated 2 years ago
- Official Code for MotionCtrl [SIGGRAPH 2024]☆1,493Feb 19, 2025Updated last year
- Experimental method to use reference video to drive motion in generations without training in ComfyUI.☆37Apr 9, 2024Updated last year
- Simple python API to read annotation data of Manga109☆129Mar 4, 2022Updated 3 years ago
- Scalable group inference for generating high quality and diverse images with diffusion models.☆42Aug 31, 2025Updated 6 months ago
- Manga&Comic text detection☆322Aug 13, 2023Updated 2 years ago
- A ComfyUI custom node that simply integrates the [animate-anyone-reproduction](https://github.com/bendanzzc/AnimateAnyone-reproduction) f…☆37Jun 14, 2024Updated last year
- Segmentation of text in manga images☆137Feb 6, 2021Updated 5 years ago
- Generate a video recap of any manga volume PDF with GPT Vision and Elevenlabs narration. Discord: https://discord.gg/MMqcuDe2WZ☆60May 22, 2025Updated 9 months ago
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆638Oct 16, 2025Updated 4 months ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Nov 25, 2022Updated 3 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- Original Full Repository of the Paper: "Domain-Adaptive Self-Supervised Pre-training for Face & Body Detection in Drawings"☆17Oct 14, 2025Updated 4 months ago
- This repository contains the official PyTorch implementation of inkn'hue: Enhancing Manga Colorization from Multiple Priors with Alignmen…☆35Jan 23, 2024Updated 2 years ago
- AnimeDiffusion: Anime Diffusion Colorization☆78Jan 7, 2024Updated 2 years ago
- ☆198Mar 18, 2023Updated 2 years ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,844Feb 1, 2025Updated last year
- ☆724Feb 9, 2024Updated 2 years ago