lwaekfjlk / mmoe
MMoE: Multimodal Mixture-of-Experts (EMNLP 2024)
☆11Updated 5 months ago
Alternatives and similar repositories for mmoe:
Users that are interested in mmoe are comparing it to the libraries listed below
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆57Updated last year
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆70Updated this week
- FeatureAlignment = Alignment + Mechanistic Interpretability☆28Updated last month
- ☆34Updated last month
- A Survey on the Honesty of Large Language Models☆57Updated 4 months ago
- ☆51Updated last week
- ☆89Updated this week
- Official implementation of "MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model". Our co…☆16Updated 4 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆41Updated 4 months ago
- ☆31Updated 6 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 4 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆108Updated last year
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆36Updated 9 months ago
- ☆44Updated 10 months ago
- ☆71Updated 10 months ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆35Updated last year
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆27Updated last week
- The official code repository for PRMBench.☆72Updated 2 months ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆12Updated 3 weeks ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆71Updated 5 months ago
- The official repository of "Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors""☆14Updated last month
- ☆44Updated 5 months ago
- ☆24Updated 2 years ago
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…☆41Updated 5 months ago
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆75Updated 2 months ago
- Model merging is a highly efficient approach for long-to-short reasoning.☆40Updated 3 weeks ago
- An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation☆116Updated last year
- 😎 curated list of awesome LMM hallucinations papers, methods & resources.☆150Updated last year
- CHAIR metric is a rule-based metric for evaluating object hallucination in caption generation.☆28Updated last year
- [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“☆18Updated 9 months ago