[CVPR 2025] This is an official inference code of the paper "BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation" . Project page: https://bizgen-msra.github.io/
☆301Apr 5, 2025Updated 11 months ago
Alternatives and similar repositories for bizgen
Users that are interested in bizgen are comparing it to the libraries listed below
Sorting:
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆284Jun 10, 2025Updated 9 months ago
- [CVPR 2025] Official repo for ART:Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation☆367Feb 5, 2026Updated last month
- ☆152Dec 17, 2024Updated last year
- [ICCV 2025] AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction☆345Apr 9, 2025Updated 11 months ago
- [ICCV 2025] Video-T1: Test-Time Scaling for Video Generation☆307Mar 7, 2026Updated 2 weeks ago
- OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]☆84Mar 12, 2025Updated last year
- ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations☆34Apr 3, 2025Updated 11 months ago
- ☆1,053May 14, 2025Updated 10 months ago
- [ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning☆1,354Sep 12, 2025Updated 6 months ago
- [ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and…☆623Sep 5, 2025Updated 6 months ago
- Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"☆82Aug 25, 2025Updated 6 months ago
- 🔥 [ICCV 2025 Highlight] InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity☆2,670Aug 22, 2025Updated 6 months ago
- ☆2,500Jul 16, 2025Updated 8 months ago
- Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling☆1,081Nov 3, 2025Updated 4 months ago
- Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)☆1,722Jul 25, 2025Updated 7 months ago
- SkyReels-A2: Compose anything in video diffusion transformers☆706Jun 3, 2025Updated 9 months ago
- Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"☆312Sep 28, 2025Updated 5 months ago
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆88Jul 11, 2024Updated last year
- An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerfu…☆452Dec 2, 2025Updated 3 months ago
- [CVPR 2025] Code for Segment Any Motion in Videos☆468Jun 10, 2025Updated 9 months ago
- [NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs…☆2,411Mar 1, 2026Updated 3 weeks ago
- In-context subject-driven image generation while preserving foreground fidelity☆352Jun 11, 2025Updated 9 months ago
- Repository of AudioX☆1,381Mar 10, 2026Updated last week
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,498Sep 11, 2025Updated 6 months ago
- ☆151Jan 31, 2024Updated 2 years ago
- Animate Any Character in Any World☆96Mar 10, 2026Updated last week
- ☆786Jul 17, 2025Updated 8 months ago
- SynCD: Generating Multi-Image Synthetic Data for Text-to-Image Customization (ICCV 2025)☆154Oct 16, 2025Updated 5 months ago
- [ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing☆3,699Oct 17, 2025Updated 5 months ago
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,623Jan 26, 2026Updated last month
- PosterMaker [CVPR 2025] https://poster-maker.github.io/☆148Nov 12, 2025Updated 4 months ago
- [ACM MM 2024 (Oral)] Official PyTorch Implementation of Paper "MovingColor: Seamless Fusion of Fine-grained Video Color Enhancement"☆11Dec 30, 2024Updated last year
- [CVPR 2025] MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation☆901Jun 12, 2025Updated 9 months ago
- Official Repo for "TheoremExplainAgent: Towards Video-based Multimodal Explanations for LLM Theorem Understanding" [ACL 2025 oral]☆1,471Jul 27, 2025Updated 7 months ago
- [NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT☆432Sep 18, 2025Updated 6 months ago
- Enhance-A-Video: Better Generated Video for Free☆593Mar 17, 2025Updated last year
- [SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"☆344Oct 30, 2025Updated 4 months ago
- [CVPR 2026] 🔥🔥 Official Repo of UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward☆182Sep 15, 2025Updated 6 months ago
- [ICCV 2025] Light-A-Video: Training-free Video Relighting via Progressive Light Fusion☆507Oct 25, 2025Updated 4 months ago