ragavsachdeva / magi
Generate a transcript for your favourite Manga: Detect manga characters, text blocks and panels. Order panels. Cluster characters. Match texts to their speakers. Perform OCR.
☆293Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for magi
- Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"☆341Updated 2 months ago
- Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Con…☆444Updated last month
- ☆397Updated 8 months ago
- [ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation☆433Updated 4 months ago
- Training-free Regional Prompting for Diffusion Transformers 🔥☆362Updated last week
- Instance segmentation for cartoon/anime characters and some visual techniques building around it.☆148Updated 7 months ago
- Implicit Style-Content Separation using B-LoRA☆302Updated last week
- ☆441Updated this week
- Official implementation of the ECCV paper "SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing"☆232Updated last month
- Put Your Face Everywhere in Seconds.☆310Updated 11 months ago
- CSGO: Content-Style Composition in Text-to-Image Generation 🔥☆259Updated 2 months ago
- Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model☆391Updated 5 months ago
- [CVPR 2024] X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model☆743Updated 3 months ago
- [NeurIPS2023] This is the official code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"☆209Updated 4 months ago
- [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models☆492Updated 10 months ago
- Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models☆237Updated last month
- Code for DesignEdit☆309Updated 4 months ago
- Memory optimized finetuning scripts for CogVideoX using TorchAO and DeepSpeed☆421Updated this week
- The official code of paper "LVCD: Reference-based Lineart Video Colorization with Diffusion Models"☆156Updated 3 weeks ago
- Official Implementation of "Learning Inclusion Matching for Animation Paint Bucket Colorization"☆260Updated last week
- NeurIPS 2024☆330Updated last month
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!☆367Updated 3 weeks ago
- Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" (TMLR 2024)☆512Updated 3 weeks ago
- UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization☆204Updated last month
- [SIGGRAPH Asia 2024 (Journal Track)]StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter☆196Updated 4 months ago
- [ECCV 2024] Be-Your-Outpainter https://arxiv.org/abs/2403.13745☆219Updated 3 weeks ago
- a CLI utility/library for AnimateDiff stable diffusion generation☆262Updated this week
- Official implementation of "Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance" (NeurIPS 2024)☆255Updated this week