adxcreative / COPELinks
☆15Updated last year
Alternatives and similar repositories for COPE
Users that are interested in COPE are comparing it to the libraries listed below
Sorting:
- Official Code for the ICCV23 Paper: "LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval…☆40Updated 2 years ago
- ☆16Updated last year
- Research Code for Multimodal-Cognition Team in Ant Group☆172Updated 3 months ago
- Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning☆21Updated 11 months ago
- ☆30Updated 2 years ago
- Efficient Multimodal Foundation Model Adaptation for Recommendation☆46Updated 2 weeks ago
- Multi-domain Recommendation with Adapter Tuning☆34Updated last year
- Source code for NoteLLM and NoteLLM-2☆143Updated 9 months ago
- ☆45Updated 10 months ago
- The repository of paper Personalized Multimodal Response Generation with Large Language Models☆17Updated last year
- Evaluation code and datasets for the ACL 2024 paper, VISTA: Visualized Text Embedding for Universal Multi-Modal Retrieval. The original c…☆46Updated last year
- [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives☆39Updated 4 months ago
- [NeurIPS 2025] Reasoning MLLM, Share-GRPO, advantage vanishing, sparse reward☆35Updated 4 months ago
- [ICLR 2024] Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond☆22Updated last year
- official repo for `thinking with images through-self-calling`☆20Updated last month
- This is the official implementation of the paper "Generative Retrieval with Semantic Tree-Structured Item Identifiers via Contrastive Lea…☆25Updated last year
- [Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations☆153Updated last year
- A Large-scale Multimodal Dataset for recommender System☆180Updated 10 months ago
- [ICLR 2023] This is the code repo for our ICLR‘23 paper "Universal Vision-Language Dense Retrieval: Learning A Unified Representation Spa…☆53Updated last year
- A collection of visual instruction tuning datasets.☆76Updated last year
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆91Updated last year
- Product1M☆90Updated 3 years ago
- ☆18Updated 2 years ago
- The dataset for paper "Why Do We Click: Visual Impression-aware News Recommendation", ACM MM 2021☆15Updated 3 years ago
- The website of CIKM 2023 resource paper "KuaiSAR: A Unified Search And Recommendation Dataset"☆29Updated 2 years ago
- All-In-One VLM: Image + Video + Transfer to Other Languages / Domains (TPAMI 2023)☆167Updated last year
- This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training …☆68Updated 9 months ago
- mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)☆98Updated 2 years ago
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆73Updated last year
- [SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval☆44Updated last year