adxcreative / COPELinks
☆15Updated last year
Alternatives and similar repositories for COPE
Users that are interested in COPE are comparing it to the libraries listed below
Sorting:
- ☆30Updated 2 years ago
- Official Code for the ICCV23 Paper: "LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval…☆40Updated 2 years ago
- ☆16Updated last year
- Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning☆21Updated 11 months ago
- Research Code for Multimodal-Cognition Team in Ant Group☆172Updated 3 months ago
- Source code for NoteLLM and NoteLLM-2☆143Updated 9 months ago
- Multi-domain Recommendation with Adapter Tuning☆34Updated last year
- [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives☆39Updated 4 months ago
- Efficient Multimodal Foundation Model Adaptation for Recommendation☆46Updated 2 weeks ago
- Evaluation code and datasets for the ACL 2024 paper, VISTA: Visualized Text Embedding for Universal Multi-Modal Retrieval. The original c…☆46Updated last year
- ☆65Updated 7 months ago
- [ACM MM 2025] The official code of "Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs"☆103Updated 2 months ago
- official code for "Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval"☆42Updated 7 months ago
- ☆38Updated last month
- The repository of paper Personalized Multimodal Response Generation with Large Language Models☆17Updated last year
- All-In-One VLM: Image + Video + Transfer to Other Languages / Domains (TPAMI 2023)☆167Updated last year
- Product1M☆90Updated 3 years ago
- ☆18Updated 2 years ago
- [SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval☆44Updated last year
- Lion: Kindling Vision Intelligence within Large Language Models☆51Updated 2 years ago
- ☆88Updated last year
- A collection of visual instruction tuning datasets.☆76Updated last year
- [CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant☆176Updated 7 months ago
- pytorch open-source library for the paper "AdaTT Adaptive Task-to-Task Fusion Network for Multitask Learning in Recommendations"☆56Updated last year
- mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)☆98Updated 2 years ago
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆73Updated last year
- [ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption☆104Updated 2 years ago
- This is the official implementation of the paper "Generative Retrieval with Semantic Tree-Structured Item Identifiers via Contrastive Lea…☆25Updated last year
- The dataset for paper "Why Do We Click: Visual Impression-aware News Recommendation", ACM MM 2021☆15Updated 3 years ago
- This repository is for our survey paper: "A Comprehensive Survey on Multimodal RAG: All Combinations of Modalities as Input and Output"☆44Updated 2 months ago