Xiaohao-Liu / ModalBedLinks
Towards Modality Generalization: A Benchmark and Prospective Analysis
☆28Updated 6 months ago
Alternatives and similar repositories for ModalBed
Users that are interested in ModalBed are comparing it to the libraries listed below
Sorting:
- A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Sp…☆28Updated last month
- Official Code of our AAAI-24 Paper: "Generative Multi-modal Knowledge Retrieval with Large Language Models".☆29Updated 2 months ago
- The latest progress of Personalized Large Language Models (LLMs).☆32Updated last month
- [NeurIPS 2024] The implementation of paper "On Softmax Direct Preference Optimization for Recommendation"☆88Updated last year
- [ICLR 2024] Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond☆20Updated last year
- [ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"☆15Updated 6 months ago
- Official codebase for the paper Latent Visual Reasoning☆54Updated last month
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆175Updated 2 months ago
- [ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination☆17Updated 10 months ago
- (ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆44Updated 5 months ago
- A paper list of Awesome Latent Space.☆190Updated this week
- [SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval☆43Updated last year
- Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"☆24Updated last year
- The implementation of paper "EliMRec: Eliminating single-modal bias in multimedia recommendation", MM'22.☆23Updated 2 years ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆94Updated last year
- ☆30Updated 2 years ago
- ☆55Updated last year
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆53Updated 8 months ago
- Regularly Truncated M-estimators for Learning with Noisy Labels☆11Updated last year
- ☆124Updated 3 weeks ago
- ☆152Updated 10 months ago
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆206Updated 7 months ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆97Updated last week
- OOD Generalization相关文章的阅读笔记☆34Updated last year
- ☆144Updated 6 months ago
- code for GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation☆18Updated last year
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆70Updated 6 months ago
- [NeurIPS 2025 Spotlight] Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆76Updated 2 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆83Updated last year
- [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"☆103Updated last year