YiyanXu / PigeonLinks
Personalized Image Generation with Large Multimodal Models
☆13Updated 5 months ago
Alternatives and similar repositories for Pigeon
Users that are interested in Pigeon are comparing it to the libraries listed below
Sorting:
- AAAI '25. Retrieval-Augmented Multimodal Social Media Popularity Prediction☆21Updated 5 months ago
- Towards Modality Generalization: A Benchmark and Prospective Analysis☆26Updated 5 months ago
- [AAAI'2025] The official implementation code of SIGMA☆34Updated 2 weeks ago
- [NeurIPS 2024] Official Code for the Paper "Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning"☆24Updated 6 months ago
- Diffusion Models for Generative Outfit Recommendation☆34Updated last year
- A preview-version of one novel multimodal reasoning benchmark CharmBench.☆23Updated 2 months ago
- A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Sp…☆22Updated last week
- ☆39Updated 7 months ago
- OOD Generalization相关文章的阅读笔记☆32Updated 10 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆227Updated 10 months ago
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆193Updated last year
- [CVPR2025] Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models☆16Updated 6 months ago
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆354Updated 2 weeks ago
- [AAAI2025] Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient☆37Updated 6 months ago
- ☆58Updated last year
- Unified the Anonymous and Camera Ready Version, hope everyone can get an ACCEPT☆251Updated 3 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆323Updated 2 weeks ago
- code for GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation☆17Updated 10 months ago
- [CVPR2025] Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters☆41Updated 7 months ago
- Large language model review prompts☆242Updated last week
- ☆51Updated 11 months ago
- This repo is the official implementation of "Retrieval-Augmented Dynamic Prompt Tuning for Incomplete Multimodal Learning" accepted by AA…☆53Updated last month
- [arXiv 25] Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR☆235Updated 2 months ago
- ICML 2025 Oral: ABKD: Pursuing a Proper Allocation of the Probability Mass in Knowledge Distillation via α-β-Divergence☆39Updated 2 months ago
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆163Updated last month
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆90Updated last week
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆85Updated 11 months ago
- [ICLR 2025 Spotlight] The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multi…☆167Updated 7 months ago
- IJCAI Review & MetaReview Monitor☆107Updated 6 months ago
- [ICCV 2025] FonTS: Text Rendering with Typography and Style Controls☆32Updated last week