Xiaohao-Liu / ModalBedLinks
Towards Modality Generalization: A Benchmark and Prospective Analysis
☆26Updated 4 months ago
Alternatives and similar repositories for ModalBed
Users that are interested in ModalBed are comparing it to the libraries listed below
Sorting:
- A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Sp…☆22Updated 3 weeks ago
- [ICLR 2024] Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond☆20Updated last year
- Official Code of our AAAI-24 Paper: "Generative Multi-modal Knowledge Retrieval with Large Language Models".☆29Updated 3 weeks ago
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆157Updated 2 weeks ago
- ☆141Updated 8 months ago
- ☆50Updated 10 months ago
- ☆123Updated 4 months ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆44Updated 3 months ago
- Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)☆51Updated 6 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆115Updated last month
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆52Updated 6 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆82Updated 10 months ago
- ☆25Updated last year
- [SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval☆43Updated last year
- [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"☆99Updated 10 months ago
- [ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"☆13Updated 4 months ago
- ☆109Updated 2 weeks ago
- Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"☆22Updated 11 months ago
- [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality☆45Updated 3 months ago
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆190Updated 5 months ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆88Updated this week
- [ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination☆16Updated 8 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆81Updated 11 months ago
- [NeurIPS 2024] The implementation of paper "On Softmax Direct Preference Optimization for Recommendation"☆84Updated 10 months ago
- [ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models☆149Updated last year
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆182Updated last month
- Code for our ICML'24 on multimodal dataset distillation☆40Updated last year
- [NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆60Updated 4 months ago
- Efficient Multimodal Foundation Model Adaptation for Recommendation☆39Updated 3 weeks ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆55Updated 11 months ago