HaoYang0123 / Creative_Generation_Pipeline
☆26Updated last year
Alternatives and similar repositories for Creative_Generation_Pipeline:
Users that are interested in Creative_Generation_Pipeline are comparing it to the libraries listed below
- Diffusion Models for Generative Outfit Recommendation☆26Updated 6 months ago
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆41Updated 2 years ago
- [ICLR 2024] Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond☆18Updated 11 months ago
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆73Updated 4 months ago
- [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives☆30Updated 5 months ago
- [WWW 2025] Official PyTorch Code for "CTR-Driven Advertising Image Generation with Multimodal Large Language Models"☆24Updated 2 weeks ago
- Product1M☆87Updated 2 years ago
- Official Code for the ICCV23 Paper: "LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval…☆41Updated last year
- The simple demo of `Unified Vision-Language Representation Modeling for E-Commerce Same-Style Products Retrieval`☆13Updated 3 months ago
- [CVPR 2023] VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval☆38Updated 2 years ago
- Narrative movie understanding benchmark☆69Updated 10 months ago
- Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning☆16Updated last month
- Research Code for Multimodal-Cognition Team in Ant Group☆138Updated 8 months ago
- ☆14Updated 9 months ago
- Official repository of MMDU dataset☆86Updated 6 months ago
- ☆155Updated 8 months ago
- ☆17Updated 8 months ago
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆68Updated 8 months ago
- ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities…☆119Updated 11 months ago
- A Large-scale Multimodal Dataset for recommender System☆136Updated last week
- A Hierarchical Attention Model for Social Contextual Image Recommendation, TKDE2019☆13Updated 4 years ago
- Multi-domain Recommendation with Adapter Tuning☆28Updated last year
- [CVPR2025] Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters☆25Updated 3 weeks ago
- A collection of visual instruction tuning datasets.☆76Updated last year
- ☆38Updated last year
- Evaluation code and datasets for the ACL 2024 paper, VISTA: Visualized Text Embedding for Universal Multi-Modal Retrieval. The original c…☆35Updated 4 months ago
- ☆61Updated last year
- ☆44Updated last year
- SIGIR paper Conversational Fashion Image Retrieval via Multiturn Natural Language Feedback☆14Updated 2 years ago
- ☆36Updated 8 months ago