drivetosouth / SafeDialBench-DatasetLinks
Official github repo for SafeDialBench, a comprehensive multi-turn dialogue benchmark to evaluate LLMs' safety.
☆34Updated 3 months ago
Alternatives and similar repositories for SafeDialBench-Dataset
Users that are interested in SafeDialBench-Dataset are comparing it to the libraries listed below
Sorting:
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆75Updated 3 months ago
- A Framework of Continual Learning☆121Updated 3 weeks ago
- ☆17Updated 2 weeks ago
- 关于LLM和Multimodal LLM的paper list☆44Updated 2 weeks ago
- Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)☆96Updated 9 months ago
- ☆138Updated 6 months ago
- A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…☆58Updated 7 months ago
- ☆49Updated 9 months ago
- Hierarchical Decomposition of Prompt-Based Continual Learning: Rethinking Obscured Sub-optimality (NeurIPS 2023, Spotlight)☆87Updated 9 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆138Updated 2 months ago
- [NeurIPS 2023] Generalized Logit Adjustment☆38Updated last year
- ☆13Updated 2 years ago
- ☆16Updated 3 months ago
- Instruction Tuning in Continual Learning paradigm☆58Updated 7 months ago
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆16Updated 6 months ago
- Evaluate robustness of adaptation methods on large vision-language models☆19Updated 2 years ago
- [NeurIPS 2024]Repos for "Visualization-of-Thought" dataset, construction code and evaluation.☆32Updated 10 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆48Updated 5 months ago
- Latest open-source "Thinking with images" (O3/O4-mini) papers, covering training-free, SFT-based, and RL-enhanced methods for "fine-grain…☆87Updated 2 weeks ago
- [NeurIPS2023] Exploring Diverse In-Context Configurations for Image Captioning☆41Updated 9 months ago
- The code repository for "OmniEvalKit: A Modular, Lightweight Toolbox for Evaluating Large Language Model and its Omni-Extensions"☆14Updated 6 months ago
- Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models☆97Updated last year
- official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"☆221Updated 3 months ago
- [ACM Multimedia 2025] This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual…☆82Updated 6 months ago
- Survey on Data-centric Large Language Models☆84Updated last year
- IDEAL: Influence-Driven Selective Annotations Empower In-Context Learners in Large Language Models☆59Updated last year
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆102Updated last week
- ZhiJian: A Unifying and Rapidly Deployable Toolbox for Pre-trained Model Reuse☆50Updated 2 years ago
- HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data (Accepted by CVPR 2024)☆48Updated last year
- Test-time Prompt Tuning (TPT) for zero-shot generalization in vision-language models (NeurIPS 2022))☆194Updated 2 years ago