LINs-lab / DynMoE
[Preprint] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models
☆41Updated 3 weeks ago
Related projects: ⓘ
- ☆20Updated last month
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆47Updated last month
- Official implementation of "LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference"☆61Updated 2 months ago
- LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models☆86Updated 4 months ago
- ☆20Updated 4 months ago
- ☆54Updated 2 months ago
- Survey on Data-centric Large Language Models☆58Updated 2 months ago
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆40Updated 3 weeks ago
- ☆23Updated 7 months ago
- An benchmark for evaluating the capabilities of large vision-language models (LVLMs)☆32Updated 10 months ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆51Updated 3 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆16Updated 2 months ago
- An Easy-to-use Hallucination Detection Framework for LLMs.☆48Updated 4 months ago
- [ICLR 2024 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"☆63Updated 3 months ago
- Rotation and Permutation for Advanced Outlier Management and Efficient Quantization of LLMs☆24Updated last week
- ☆28Updated 7 months ago
- 📰 Must-read papers on KV Cache Compression (constantly updating 🤗).☆34Updated this week
- This repo contains the source code for: Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs☆28Updated last month
- MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria☆49Updated last month
- This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strat…☆66Updated 5 months ago
- HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data (Accepted by CVPR 2024)☆39Updated 2 months ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆28Updated 5 months ago
- [EMNLP'23] The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆67Updated 5 months ago
- ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching☆87Updated 3 months ago
- VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)☆21Updated 2 months ago
- This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"☆138Updated 5 months ago
- MATH-Vision dataset and code to measure Multimodal Mathematical Reasoning capabilities.☆53Updated 3 weeks ago
- Official repository of MMDU dataset☆61Updated last month
- HallE-Control: Controlling Object Hallucination in LMMs☆24Updated 5 months ago
- 😎 up-to-date & curated list of awesome LMM hallucinations papers, methods & resources.☆140Updated 5 months ago