lwaekfjlk / mmoe
MMoE: Multimodal Mixture-of-Experts (EMNLP 2024)
☆11Updated 5 months ago
Alternatives and similar repositories for mmoe:
Users that are interested in mmoe are comparing it to the libraries listed below
- FeatureAlignment = Alignment + Mechanistic Interpretability☆28Updated 2 months ago
- ☆34Updated 2 months ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆57Updated last year
- ☆59Updated 3 weeks ago
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆36Updated last month
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆109Updated last year
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆79Updated this week
- ☆55Updated 6 months ago
- Latest Advances on Modality Priors in Multimodal Large Language Models☆17Updated this week
- The reinforcement learning codes for dataset SPA-VL☆32Updated 10 months ago
- A Survey on the Honesty of Large Language Models☆57Updated 5 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆53Updated 5 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆71Updated 5 months ago
- ☆17Updated last year
- [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“☆19Updated 10 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 5 months ago
- Model merging is a highly efficient approach for long-to-short reasoning.☆46Updated last month
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆55Updated 9 months ago
- [ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models☆46Updated 2 months ago
- ☆73Updated 11 months ago
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆79Updated 2 months ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆37Updated last year
- ☆23Updated 7 months ago
- The official code repository for PRMBench.☆73Updated 2 months ago
- ☆95Updated 3 weeks ago
- ☆24Updated 2 years ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆58Updated 4 months ago
- A curated list of resources for activation engineering☆69Updated this week
- Language Imbalance Driven Rewarding for Multilingual Self-improving☆17Updated 6 months ago
- ☆32Updated 7 months ago