Xiaohao-Liu / ModalBedLinks
Towards Modality Generalization: A Benchmark and Prospective Analysis
☆26Updated 5 months ago
Alternatives and similar repositories for ModalBed
Users that are interested in ModalBed are comparing it to the libraries listed below
Sorting:
- A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Sp…☆23Updated last week
- Regularly Truncated M-estimators for Learning with Noisy Labels☆11Updated last year
- [ICLR 2024] Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond☆20Updated last year
- Official Code of our AAAI-24 Paper: "Generative Multi-modal Knowledge Retrieval with Large Language Models".☆29Updated last month
- [ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"☆13Updated 5 months ago
- code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning☆18Updated last year
- The implementation of paper "EliMRec: Eliminating single-modal bias in multimedia recommendation", MM'22.☆23Updated last year
- [NeurIPS 2024] The implementation of paper "On Softmax Direct Preference Optimization for Recommendation"☆84Updated 11 months ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆90Updated last week
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆44Updated 4 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆85Updated 11 months ago
- ☆144Updated 8 months ago
- [ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination☆16Updated 9 months ago
- ☆51Updated 11 months ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆36Updated 6 months ago
- AdaMoLE: Adaptive Mixture of LoRA Experts☆37Updated last year
- (ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆42Updated 4 months ago
- [SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval☆43Updated last year
- Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)☆57Updated 6 months ago
- [CVPR25 Highlight] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced eval…☆26Updated 6 months ago
- Video Chain of Thought, Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"☆169Updated 8 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆83Updated 11 months ago
- A Task of Fictitious Unlearning for VLMs☆23Updated 6 months ago
- ☆98Updated last month
- Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"☆23Updated 11 months ago
- ☆29Updated 2 years ago
- OOD Generalization相关文章的阅读笔记☆32Updated 10 months ago
- ☆129Updated 5 months ago
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆163Updated last month
- ✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).☆54Updated 7 months ago