Xiaohao-Liu / ModalBed
Towards Modality Generalization: A Benchmark and Prospective Analysis
☆24Updated 3 months ago
Alternatives and similar repositories for ModalBed
Users that are interested in ModalBed are comparing it to the libraries listed below
Sorting:
- ☆11Updated 3 months ago
- [ICLR 2024] Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond☆20Updated last year
- Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"☆16Updated this week
- Official Code of our AAAI-24 Paper: "Generative Multi-modal Knowledge Retrieval with Large Language Models".☆26Updated 4 months ago
- [NeurIPS 2024] The implementation of paper "On Softmax Direct Preference Optimization for Recommendation"☆72Updated 5 months ago
- Awesome-Efficient-Inference-for-LRMs is a collection of state-of-the-art, novel, exciting, token-efficient methods for Large Reasoning Mo…☆63Updated 3 weeks ago
- ☆14Updated 8 months ago
- The implementation of paper "EliMRec: Eliminating single-modal bias in multimedia recommendation", MM'22.☆19Updated last year
- The code for the paper "MISSRec: Pre-training and Transferring Multi-modal Interest-aware Sequence Representation for Recommendation" (AC…☆45Updated last year
- The implementation of paper "Leveraging Multimodal Features and Item-level User Feedback for Bundle Construction", WSDM'24.☆15Updated 8 months ago
- code for GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation☆16Updated 5 months ago
- Diffusion Models for Generative Outfit Recommendation☆26Updated 8 months ago
- (ICLR2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆34Updated 2 months ago
- ☆24Updated last year
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆37Updated last month
- The official implementation of InvRL☆13Updated 2 years ago
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆72Updated last week
- ☆36Updated 5 months ago
- IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT☆31Updated 5 months ago
- Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"☆19Updated 6 months ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆39Updated 2 weeks ago
- ☆41Updated last year
- ☆32Updated 2 weeks ago
- ☆47Updated 5 months ago
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆50Updated this week
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆34Updated 3 weeks ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆37Updated last year
- ☆117Updated 3 months ago
- ☆19Updated 8 months ago
- ☆12Updated 11 months ago