drivetosouth / SafeDialBench-DatasetLinks
Official github repo for SafeDialBench, a comprehensive multi-turn dialogue benchmark to evaluate LLMs' safety.
☆38Updated 7 months ago
Alternatives and similar repositories for SafeDialBench-Dataset
Users that are interested in SafeDialBench-Dataset are comparing it to the libraries listed below
Sorting:
- A Framework of Continual Learning☆128Updated last month
- ☆20Updated 4 months ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆97Updated last month
- 关于LLM和Multimodal LLM的paper list☆52Updated last week
- [NeurIPS 2023] Generalized Logit Adjustment☆39Updated last year
- Instruction Tuning in Continual Learning paradigm☆68Updated 11 months ago
- A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…☆59Updated 11 months ago
- Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)☆99Updated last year
- ☆154Updated 10 months ago
- Latest open-source "Thinking with images" (O3/O4-mini) papers, covering training-free, SFT-based, and RL-enhanced methods for "fine-grain…☆108Updated 4 months ago
- ☆55Updated last year
- A paper list of Awesome Latent Space.☆276Updated last week
- [ACL'25 Main] Official Implementation of HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Languag…☆46Updated 4 months ago
- Test-time Prompt Tuning (TPT) for zero-shot generalization in vision-language models (NeurIPS 2022))☆203Updated 3 years ago
- Survey on Data-centric Large Language Models☆88Updated last year
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆415Updated last year
- The code repository for "OmniEvalKit: A Modular, Lightweight Toolbox for Evaluating Large Language Model and its Omni-Extensions"☆14Updated 10 months ago
- Hierarchical Decomposition of Prompt-Based Continual Learning: Rethinking Obscured Sub-optimality (NeurIPS 2023, Spotlight)☆90Updated last year
- up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources☆241Updated 3 months ago
- ☆21Updated 7 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆234Updated 2 months ago
- ☆30Updated 2 years ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆127Updated 4 months ago
- ☆112Updated 4 months ago
- Open-source red teaming framework for MLLMs with 37+ attack methods☆148Updated this week
- [ACM Multimedia 2025] This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual…☆82Updated 10 months ago
- Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models☆106Updated last year
- This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and contin…☆90Updated last year
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆17Updated 10 months ago
- ☆150Updated last year