ragavsachdeva / magi
Generate a transcript for your favourite Manga: Detect manga characters, text blocks and panels. Order panels. Cluster characters. Match texts to their speakers. Perform OCR.
☆289Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for magi
- Instance segmentation for cartoon/anime characters and some visual techniques building around it.☆146Updated 7 months ago
- Official Implementation of "Learning Inclusion Matching for Animation Paint Bucket Colorization"☆254Updated last week
- The official code of paper "LVCD: Reference-based Lineart Video Colorization with Diffusion Models"☆149Updated last week
- [ECCV 2024] Be-Your-Outpainter https://arxiv.org/abs/2403.13745☆214Updated last week
- Apply screentone to line drawings or colored illustrations with diffusion models.☆35Updated 4 months ago
- Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models☆231Updated 2 weeks ago
- Official implement of ID-Aligner☆119Updated 6 months ago
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆126Updated 9 months ago
- Put Your Face Everywhere in Seconds.☆309Updated 11 months ago
- [ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models☆635Updated 4 months ago
- SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.☆424Updated this week
- [ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation☆423Updated 4 months ago
- Official Pytorch Implementation of "FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Inje…☆178Updated 9 months ago
- Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" (TMLR 2024)☆501Updated last week
- Official PyTorch implementation for the paper "AnimateZero: Video Diffusion Models are Zero-Shot Image Animators"☆350Updated 11 months ago
- ☆103Updated 8 months ago
- ☆122Updated this week
- [SIGGRAPH Asia 2024 (Journal Track)]StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter☆193Updated 3 months ago
- This repository contains the official PyTorch implementation of inkn'hue: Enhancing Manga Colorization from Multiple Priors with Alignmen…☆21Updated 9 months ago
- Official repository of In-Context LoRA for Diffusion Transformers☆362Updated this week
- ☆196Updated 9 months ago
- ☆113Updated last week
- Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence☆349Updated 7 months ago
- Training-free Regional Prompting for Diffusion Transformers 🔥☆267Updated this week
- SigLIP-based Aesthetic Score Predictor☆135Updated 2 weeks ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆247Updated 2 weeks ago
- Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Con…☆441Updated 2 weeks ago
- I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models☆200Updated 10 months ago