drivetosouth / SafeDialBench-DatasetLinks
Official github repo for SafeDialBench, a comprehensive multi-turn dialogue benchmark to evaluate LLMs' safety.
☆35Updated 5 months ago
Alternatives and similar repositories for SafeDialBench-Dataset
Users that are interested in SafeDialBench-Dataset are comparing it to the libraries listed below
Sorting:
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆84Updated last month
- A Framework of Continual Learning☆124Updated 2 months ago
- ☆19Updated 2 months ago
- 关于LLM和Multimodal LLM的paper list☆49Updated last month
- Instruction Tuning in Continual Learning paradigm☆62Updated 8 months ago
- A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…☆58Updated 9 months ago
- ☆29Updated 2 years ago
- [NeurIPS 2023] Generalized Logit Adjustment☆39Updated last year
- ☆13Updated 2 years ago
- ☆144Updated 8 months ago
- ☆18Updated 5 months ago
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆16Updated 8 months ago
- [ICML 2024] SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised Learning☆30Updated last year
- IDEAL: Influence-Driven Selective Annotations Empower In-Context Learners in Large Language Models☆59Updated last year
- Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)☆97Updated 11 months ago
- ☆51Updated 11 months ago
- [ACM Multimedia 2025] This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual…☆82Updated 8 months ago
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆15Updated 4 months ago
- [ACL'25 Main] Official Implementation of HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Languag…☆37Updated last month
- Hierarchical Decomposition of Prompt-Based Continual Learning: Rethinking Obscured Sub-optimality (NeurIPS 2023, Spotlight)☆87Updated 11 months ago
- The code repository for "OmniEvalKit: A Modular, Lightweight Toolbox for Evaluating Large Language Model and its Omni-Extensions"☆14Updated 8 months ago
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆387Updated 10 months ago
- Latest open-source "Thinking with images" (O3/O4-mini) papers, covering training-free, SFT-based, and RL-enhanced methods for "fine-grain…☆95Updated 2 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆118Updated last month
- This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and contin…☆82Updated last year
- [ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinatio…☆24Updated 8 months ago
- Survey on Data-centric Large Language Models☆86Updated last year
- Evaluate robustness of adaptation methods on large vision-language models☆19Updated 2 years ago
- Code for our ICML'24 on multimodal dataset distillation☆40Updated last year
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆54Updated 7 months ago