Xiaohao-Liu / ModalBedLinks
Towards Modality Generalization: A Benchmark and Prospective Analysis
☆28Updated 7 months ago
Alternatives and similar repositories for ModalBed
Users that are interested in ModalBed are comparing it to the libraries listed below
Sorting:
- A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Sp…☆29Updated 2 months ago
- Official Code of our AAAI-24 Paper: "Generative Multi-modal Knowledge Retrieval with Large Language Models".☆29Updated 3 months ago
- [ICLR 2024] Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond☆21Updated last year
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆103Updated last week
- [ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"☆15Updated 7 months ago
- ☆153Updated 7 months ago
- ☆43Updated 9 months ago
- [NeurIPS 2024] The implementation of paper "On Softmax Direct Preference Optimization for Recommendation"☆94Updated last year
- The latest progress of Personalized Large Language Models (LLMs).☆33Updated 2 months ago
- ☆55Updated last year
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆73Updated 7 months ago
- Quantile Advantage Estimation for Entropy-Safe Reasoning☆23Updated 2 months ago
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆180Updated 3 months ago
- Official code for our paper "Model Composition for Multimodal Large Language Models" (ACL 2024)☆31Updated 11 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆99Updated last year
- ☆129Updated last month
- [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality☆60Updated 6 months ago
- The implementation of paper "EliMRec: Eliminating single-modal bias in multimedia recommendation", MM'22.☆22Updated 2 years ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆57Updated last year
- ☆154Updated 10 months ago
- [SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval☆44Updated last year
- [ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination☆17Updated 11 months ago
- [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"☆107Updated last year
- ☆27Updated last year
- Survey on Data-centric Large Language Models☆88Updated last year
- Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)☆61Updated 8 months ago
- (ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆44Updated 6 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆53Updated 9 months ago
- [NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆73Updated 7 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆127Updated 3 months ago