Now-Join-Us / OmniEvalKit
The code repository for "OmniEvalKit: A Modular, Lightweight Toolbox for Evaluating Large Language Model and its Omni-Extensions"
☆14Updated 2 months ago
Alternatives and similar repositories for OmniEvalKit
Users that are interested in OmniEvalKit are comparing it to the libraries listed below
Sorting:
- ZhiJian: A Unifying and Rapidly Deployable Toolbox for Pre-trained Model Reuse☆51Updated last year
- ☆47Updated 5 months ago
- [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"☆87Updated 5 months ago
- 关于LLM和Multimodal LLM的paper list☆38Updated last week
- OOD Generalization相关文章的阅读笔记☆31Updated 5 months ago
- Latest Advances on Modality Priors in Multimodal Large Language Models☆18Updated last week
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆67Updated 3 months ago
- [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality☆27Updated last week
- Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)☆89Updated 5 months ago
- CVPR 2023: Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification☆91Updated 11 months ago
- The code repository for "MOS: Model Surgery for Pre-Trained Model-Based Class-Incremental Learning"(AAAI25) in PyTorch.☆18Updated last month
- ☆17Updated 3 months ago
- ☆34Updated 2 months ago
- Instruction Tuning in Continual Learning paradigm☆47Updated 3 months ago
- up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources☆125Updated this week
- ☆53Updated 6 months ago
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆35Updated 4 months ago
- ☆76Updated last month
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆74Updated 5 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆55Updated 5 months ago
- Less is More: High-value Data Selection for Visual Instruction Tuning☆12Updated 3 months ago
- [ICML 2024] Offical code repo for ICML2024 paper "Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning with …☆26Updated 10 months ago
- This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strat…☆78Updated 2 months ago
- 🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".☆35Updated last month
- [NeurIPS 2023] "Diversified Outlier Exposure for Out-of-Distribution Detection via Informative Extrapolation"☆11Updated last year
- ☆24Updated 8 months ago
- Hierarchical Decomposition of Prompt-Based Continual Learning: Rethinking Obscured Sub-optimality (NeurIPS 2023, Spotlight)☆83Updated 6 months ago
- 🎉 The code repository for "Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning" in PyTorc…☆12Updated 3 weeks ago
- [NeurIPS 2023] Generalized Logit Adjustment☆37Updated last year
- Official github repo for SafeDialBench, a comprehensive multi-turn dialogue benchmark to evaluate LLMs' safety.☆31Updated this week