drivetosouth / SafeDialBench-DatasetLinks
Official github repo for SafeDialBench, a comprehensive multi-turn dialogue benchmark to evaluate LLMs' safety.
☆37Updated 6 months ago
Alternatives and similar repositories for SafeDialBench-Dataset
Users that are interested in SafeDialBench-Dataset are comparing it to the libraries listed below
Sorting:
- A Framework of Continual Learning☆124Updated 3 months ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆86Updated last month
- ☆20Updated 3 months ago
- 关于LLM和Multimodal LLM的paper list☆50Updated last month
- A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…☆58Updated 10 months ago
- ☆149Updated 9 months ago
- Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)☆98Updated last year
- Instruction Tuning in Continual Learning paradigm☆65Updated 9 months ago
- [NeurIPS 2023] Generalized Logit Adjustment☆39Updated last year
- ☆53Updated 11 months ago
- ZhiJian: A Unifying and Rapidly Deployable Toolbox for Pre-trained Model Reuse☆48Updated 2 years ago
- up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources☆208Updated last month
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…☆160Updated last month
- ☆13Updated 2 years ago
- ☆118Updated 2 years ago
- ☆284Updated 4 months ago
- Official Repository of "Learning what reinforcement learning can't"☆69Updated 2 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆118Updated 2 months ago
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆396Updated 10 months ago
- [ICML 2025 Oral] The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchma…☆69Updated 4 months ago
- Test-time Prompt Tuning (TPT) for zero-shot generalization in vision-language models (NeurIPS 2022))☆199Updated 3 years ago
- ☆20Updated 6 months ago
- ☆109Updated 2 months ago
- [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality☆54Updated 4 months ago
- Paper List of Inference/Test Time Scaling/Computing☆322Updated 2 months ago
- The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆226Updated 3 months ago
- MAT: Multi-modal Agent Tuning 🔥 ICLR 2025 (Spotlight)☆72Updated 5 months ago
- The code repository for "OmniEvalKit: A Modular, Lightweight Toolbox for Evaluating Large Language Model and its Omni-Extensions"☆14Updated 9 months ago
- ☆81Updated last year
- [ACM Multimedia 2025] This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual…☆82Updated 8 months ago