HaoYang0123 / Creative_Generation_PipelineLinks
☆29Updated 2 years ago
Alternatives and similar repositories for Creative_Generation_Pipeline
Users that are interested in Creative_Generation_Pipeline are comparing it to the libraries listed below
Sorting:
- Official Code for the ICCV23 Paper: "LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval…☆40Updated 2 years ago
- Product1M☆89Updated 3 years ago
- ☆21Updated last month
- Evaluation code and datasets for the ACL 2024 paper, VISTA: Visualized Text Embedding for Universal Multi-Modal Retrieval. The original c…☆43Updated last year
- [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives☆39Updated 2 months ago
- [WWW 2025] Official PyTorch Code for "CTR-Driven Advertising Image Generation with Multimodal Large Language Models"☆58Updated 3 months ago
- ☆60Updated 5 months ago
- mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)☆97Updated 2 years ago
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆74Updated last year
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Updated 2 years ago
- [CVPR 2023] VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval☆38Updated 2 years ago
- The repository of paper Personalized Multimodal Response Generation with Large Language Models☆17Updated last year
- ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities…☆121Updated 2 months ago
- ☆14Updated 11 months ago
- official code for "Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval"☆38Updated 4 months ago
- Research Code for Multimodal-Cognition Team in Ant Group☆169Updated last month
- ☆30Updated last month
- Narrative movie understanding benchmark☆77Updated 5 months ago
- LayoutDiT: Exploring Content-Graphic Balance in Layout Generation with Diffusion Transformer☆49Updated 11 months ago
- Dataset pruning for ImageNet and LAION-2B.☆79Updated last year
- ☆37Updated last year
- [ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback☆59Updated last year
- [ICLR 2024] Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond☆20Updated last year
- Diffusion Models for Generative Outfit Recommendation☆36Updated last year
- EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling☆171Updated last week
- ☆45Updated 2 years ago
- ☆46Updated 3 years ago
- ChatSD is designed to make image generation tasks easily☆21Updated 2 years ago
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆190Updated 2 years ago
- M5Product: Self-harmonized Contrastive Learning for E-commercial Multi-modal Pretraining CVPR 2022☆34Updated 3 years ago