PKU-YuanGroup / Hallucination-Attack
Attack to induce LLMs within hallucinations
☆99Updated 4 months ago
Related projects: ⓘ
- An Easy-to-use Hallucination Detection Framework for LLMs.☆48Updated 4 months ago
- Accepted by ECCV 2024☆59Updated 2 months ago
- ☆143Updated 9 months ago
- 😎 up-to-date & curated list of awesome Attacks on Large-Vision-Language-Models papers, methods & resources.☆73Updated this week
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning☆79Updated 3 months ago
- ☆28Updated 7 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆47Updated last month
- [ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shi…☆34Updated 2 months ago
- ☆62Updated 7 months ago
- ☆27Updated 3 months ago
- 😎 up-to-date & curated list of awesome LMM hallucinations papers, methods & resources.☆140Updated 5 months ago
- A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust)☆88Updated last week
- Accepted by IJCAI-24 Survey Track☆117Updated 3 weeks ago
- [ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"☆61Updated 9 months ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆109Updated last week
- 【ACL 2024】 SALAD benchmark & MD-Judge☆81Updated this week
- ☆110Updated last month
- [CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding☆177Updated 2 months ago
- [arXiv:2311.03191] "DeepInception: Hypnotize Large Language Model to Be Jailbreaker"☆109Updated 7 months ago
- Jailbreaking Large Vision-language Models via Typographic Visual Prompts☆76Updated 4 months ago
- An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation☆85Updated 8 months ago
- Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizX…☆57Updated 6 months ago
- Survey on Data-centric Large Language Models☆58Updated 2 months ago
- Repository for the Paper (AAAI 2024, Oral) --- Visual Adversarial Examples Jailbreak Large Language Models☆156Updated 4 months ago
- [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(…☆220Updated 6 months ago
- JailBreakV-28K: A comprehensive benchmark designed to evaluate the transferability of LLM jailbreak attacks to MLLMs, and further assess …☆29Updated 2 months ago
- ICLR2024 Paper. Showing properties of safety tuning and exaggerated safety.☆61Updated 4 months ago
- R-Judge: Benchmarking Safety Risk Awareness for LLM Agents☆57Updated last month
- The source code of the EMNLP 2023 main conference paper: Sparse Low-rank Adaptation of Pre-trained Language Models.☆62Updated 6 months ago
- [NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey☆65Updated last month