1230young / bizgenView external linksLinks
[CVPR 2025] This is an official inference code of the paper "BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation" . Project page: https://bizgen-msra.github.io/
☆298Apr 5, 2025Updated 10 months ago
Alternatives and similar repositories for bizgen
Users that are interested in bizgen are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Official repo for ART:Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation☆364Feb 5, 2026Updated last week
- ☆152Dec 17, 2024Updated last year
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆272Jun 10, 2025Updated 8 months ago
- [ICCV 2025] AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction☆346Apr 9, 2025Updated 10 months ago
- Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"☆78Aug 25, 2025Updated 5 months ago
- [ICCV 2025] Video-T1: Test-Time Scaling for Video Generation☆306Jun 29, 2025Updated 7 months ago
- OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]☆85Mar 12, 2025Updated 11 months ago
- ☆1,048May 14, 2025Updated 9 months ago
- ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations☆33Apr 3, 2025Updated 10 months ago
- [ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and…☆621Sep 5, 2025Updated 5 months ago
- An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerfu…☆449Dec 2, 2025Updated 2 months ago
- [ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning☆1,350Sep 12, 2025Updated 5 months ago
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆86Jul 11, 2024Updated last year
- ☆2,500Jul 16, 2025Updated 7 months ago
- [CVPR 2025] Code for Segment Any Motion in Videos☆459Jun 10, 2025Updated 8 months ago
- SkyReels-A2: Compose anything in video diffusion transformers☆701Jun 3, 2025Updated 8 months ago
- In-context subject-driven image generation while preserving foreground fidelity☆350Jun 11, 2025Updated 8 months ago
- ☆151Jan 31, 2024Updated 2 years ago
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,480Sep 11, 2025Updated 5 months ago
- Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"☆308Sep 28, 2025Updated 4 months ago
- Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)☆1,715Jul 25, 2025Updated 6 months ago
- Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling☆1,078Nov 3, 2025Updated 3 months ago
- 🔥 [ICCV 2025 Highlight] InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity☆2,663Aug 22, 2025Updated 5 months ago
- [NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs…☆2,354Feb 2, 2026Updated 2 weeks ago
- Repository of AudioX☆1,135Updated this week
- Official PyTorch Implementation of Ctrl-Crash 💥☆48Jun 3, 2025Updated 8 months ago
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,620Jan 26, 2026Updated 3 weeks ago
- Animate Any Character in Any World☆88Jan 9, 2026Updated last month
- [ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing☆3,638Oct 17, 2025Updated 4 months ago
- [CVPR 2025] MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation☆880Jun 12, 2025Updated 8 months ago
- DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging☆47Apr 27, 2025Updated 9 months ago
- PosterMaker [CVPR 2025] https://poster-maker.github.io/☆143Nov 12, 2025Updated 3 months ago
- [NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT☆430Sep 18, 2025Updated 4 months ago
- A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…☆2,137Dec 29, 2025Updated last month
- [SIGGRAPH Asia 2025] DreamO: A Unified Framework for Image Customization☆1,745Aug 14, 2025Updated 6 months ago
- Enhance-A-Video: Better Generated Video for Free☆593Mar 17, 2025Updated 11 months ago
- SynCD: Generating Multi-Image Synthetic Data for Text-to-Image Customization (ICCV 2025)☆151Oct 16, 2025Updated 4 months ago
- [ICCV 2025] Light-A-Video: Training-free Video Relighting via Progressive Light Fusion☆502Oct 25, 2025Updated 3 months ago
- ☆787Jul 17, 2025Updated 7 months ago