Xiaohao-Liu / ModalBedLinks
Towards Modality Generalization: A Benchmark and Prospective Analysis
☆28Updated 6 months ago
Alternatives and similar repositories for ModalBed
Users that are interested in ModalBed are comparing it to the libraries listed below
Sorting:
- A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Sp…☆25Updated 2 weeks ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆88Updated last year
- The latest progress of Personalized Large Language Models (LLMs).☆29Updated 3 weeks ago
- Official Code of our AAAI-24 Paper: "Generative Multi-modal Knowledge Retrieval with Large Language Models".☆29Updated 2 months ago
- [ICLR 2024] Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond☆20Updated last year
- ☆53Updated 11 months ago
- Official codebase for the paper Latent Visual Reasoning☆37Updated last month
- [ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"☆13Updated 6 months ago
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆168Updated last month
- [NeurIPS 2024] The implementation of paper "On Softmax Direct Preference Optimization for Recommendation"☆88Updated 11 months ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆92Updated 2 weeks ago
- ☆120Updated last week
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆86Updated 9 months ago
- [SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval☆43Updated last year
- [NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆69Updated 5 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆120Updated 2 months ago
- OOD Generalization相关文章的阅读笔记☆33Updated 11 months ago
- Survey on Data-centric Large Language Models☆88Updated last year
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆44Updated 4 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆83Updated last year
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆61Updated 5 months ago
- [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"☆101Updated 11 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆52Updated 8 months ago
- [ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination☆17Updated 9 months ago
- AdaMoLE: Adaptive Mixture of LoRA Experts☆37Updated last year
- ☆132Updated 5 months ago
- [NeurIPS 2025 Spotlight] Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆73Updated 2 months ago
- (ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆44Updated 4 months ago
- ☆57Updated 3 months ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆55Updated last year