[WWW 2025] Official PyTorch Code for "CTR-Driven Advertising Image Generation with Multimodal Large Language Models"
☆62Aug 3, 2025Updated 7 months ago
Alternatives and similar repositories for CAIG
Users that are interested in CAIG are comparing it to the libraries listed below
Sorting:
- [AAAI-26] Are We on the Right Way for Assessing Document Retrieval-Augmented Generation?☆26Dec 14, 2025Updated 2 months ago
- Anomaly Detection for time-series using Multilevel Wavelet Decomposition Networks.☆10Dec 11, 2019Updated 6 years ago
- ☆18Jun 14, 2025Updated 8 months ago
- [ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback☆59Nov 8, 2024Updated last year
- Official PyTorch implementation of paper MAVIN: Multi-Action Video Generation with Diffusion Models via Transition Video Infilling☆13Oct 5, 2024Updated last year
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆22Sep 25, 2025Updated 5 months ago
- [ICML 2025] Playmate: Flexible Control of Portrait Animation via 3D-Implicit Space Guided Diffusion☆34Nov 10, 2025Updated 3 months ago
- official code for paper: Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark☆41Jan 9, 2024Updated 2 years ago
- Blending Custom Photos with Video Diffusion Transformers☆48Jan 21, 2025Updated last year
- [ACM MM 2025] MLLMs for Aesthetics Reasoning☆23Jan 5, 2026Updated 2 months ago
- Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-based Embedding Rou…☆34Sep 25, 2025Updated 5 months ago
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models (ICLR 2026)☆42Updated this week
- [CVPR2024] Official code for Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic Propagation☆87Apr 16, 2024Updated last year
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆87Jul 11, 2024Updated last year
- ☆28Feb 11, 2025Updated last year
- ☆26Oct 30, 2024Updated last year
- ☆43Jul 28, 2025Updated 7 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆72Jul 13, 2025Updated 7 months ago
- [ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen…☆277Jan 7, 2026Updated last month
- RepText: Rendering Visual Text via Replicating 🔥☆141Jun 7, 2025Updated 8 months ago
- [ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion models☆91Sep 11, 2025Updated 5 months ago
- Official code for paper: Desigen: A Pipeline for Controllable Design Template Generation [CVPR'24]☆75Jul 18, 2024Updated last year
- This is the PyTorch implementation of the Siggraph 2023 paper "Efficient Video Portrait Reenactment via Grid-based Codebook"☆39Aug 28, 2023Updated 2 years ago
- [CVPR 2023] Source code for NoisyTwins: Class-consistent and Diverse Image Generation Through StyleGANs☆36Apr 8, 2024Updated last year
- Official code for ICCV 2025 paper, X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distill…☆90Jun 26, 2025Updated 8 months ago
- Gambot is an open-source trading bot that identifies profitable sports betting opportunities and executes them on Polymarket. By pulling …☆23Apr 20, 2025Updated 10 months ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Jan 12, 2026Updated last month
- ☆95Mar 3, 2025Updated last year
- [ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and…☆622Sep 5, 2025Updated 6 months ago
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆86Jul 13, 2024Updated last year
- MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆234Jul 11, 2024Updated last year
- 基于docker,docker-compose部署目前微服务所需主流基础服务,包括 日志收集组件elk(elasticsearch & logstash & kibana & filebeat), 链路追踪skywalking, 项目可视化组件【收集/监控/告警】(gra…☆10Mar 28, 2022Updated 3 years ago
- [ICCV 2025] VisRL: Intention-Driven Visual Perception via Reinforced Reasoning☆45Nov 8, 2025Updated 3 months ago
- 第二届广州·琶洲算法大赛-智能交通CV模型赛题第4名方案☆11Aug 9, 2023Updated 2 years ago
- DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging☆47Apr 27, 2025Updated 10 months ago
- ☆11Oct 22, 2023Updated 2 years ago
- ☆18Feb 16, 2025Updated last year
- ☆11Oct 31, 2024Updated last year
- A large scale inpainting & t2i anime image dataset☆14Oct 18, 2025Updated 4 months ago