YiyanXu / PigeonLinks
Personalized Image Generation with Large Multimodal Models
☆14Updated 7 months ago
Alternatives and similar repositories for Pigeon
Users that are interested in Pigeon are comparing it to the libraries listed below
Sorting:
- Diffusion Models for Generative Outfit Recommendation☆36Updated last year
- code for GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation☆18Updated last year
- The repository of paper Personalized Multimodal Response Generation with Large Language Models☆17Updated last year
- Efficient Multimodal Foundation Model Adaptation for Recommendation☆45Updated 2 months ago
- Towards Modality Generalization: A Benchmark and Prospective Analysis☆28Updated 6 months ago
- [WWW 2025] Following Clues, Approaching the Truth: Explainable Micro-Video Rumor Detection via Chain-of-Thought Reasoning☆22Updated 9 months ago
- AAAI '25. Retrieval-Augmented Multimodal Social Media Popularity Prediction☆22Updated 7 months ago
- [AAAI'2025] The official implementation code of SIGMA☆37Updated 2 months ago
- [NeurIPS 2024] The implementation of paper "On Softmax Direct Preference Optimization for Recommendation"☆89Updated last year
- [ACM MM'2024]"DiffMM: Multi-Modal Diffusion Model for Recommendation"☆90Updated last year
- ☆60Updated last year
- A Large Short-video Recommendation Dataset with Raw Text/Audio/Image/Videos (Talk Invited by DeepMind).☆237Updated 10 months ago
- [NeurIPS 2024] Official Code for the Paper "Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning"☆25Updated 8 months ago
- [CVPR2025] Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models☆16Updated 7 months ago
- OOD Generalization相关文章的阅读笔记☆35Updated last year
- [ACL 2024] PyTorch implementation for "Stealthy Attack on Large Language Model based Recommendation"☆18Updated last year
- A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Sp…☆28Updated last month
- Agentic MLLMs☆111Updated last month
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆97Updated 2 weeks ago
- ☆48Updated last year
- [CVPR2025] Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters☆42Updated 9 months ago
- The code for the paper "MISSRec: Pre-training and Transferring Multi-modal Interest-aware Sequence Representation for Recommendation" (AC…☆59Updated last year
- ☆55Updated last year
- The code repo for our AAAI-25 paper 'Harnessing Multimodal Large Language Models for Multimodal Sequential Recommendation'☆41Updated 8 months ago
- ☆29Updated 2 years ago
- ☆41Updated 8 months ago
- [ICLR 2025 Oral 🏆] The implementation of paper "Language Representations Can be What Recommenders Need: Findings and Potentials"☆95Updated 7 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆96Updated last year
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆177Updated 2 months ago
- A Large-scale Multimodal Dataset for recommender System☆172Updated 9 months ago