AI4LIFE-GROUP / med-safety-bench
MedSafetyBench: Evaluating and Improving the Medical Safety of LLMs [NeurIPS 2024]
β24Updated 6 months ago
Alternatives and similar repositories for med-safety-bench:
Users that are interested in med-safety-bench are comparing it to the libraries listed below
- π€« Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Conβ¦β42Updated last year
- source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"β40Updated 2 weeks ago
- Code for Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities (NeurIPS'24)β21Updated 4 months ago
- [NeurIPS 2023] Official repository for "Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models"β12Updated 10 months ago
- Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Spβ¦β20Updated last year
- β4Updated 2 months ago
- β28Updated last month
- EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images, NeurIPS 2023 D&Bβ76Updated 9 months ago
- Source code and data for ADEPT: A DEbiasing PrompT Framework (AAAI-23).β14Updated 4 months ago
- Discover and Cure: Concept-aware Mitigation of Spurious Correlation (ICML 2023)β41Updated last year
- [ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation wiβ¦β38Updated 10 months ago
- SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilitiesβ12Updated 3 weeks ago
- β29Updated 11 months ago
- β28Updated last year
- Source code for EMNLP2022 paper "Finding Skill Neurons in Pre-trained Transformers via Prompt Tuning".β18Updated 2 years ago
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescueβ35Updated 5 months ago
- β27Updated last month
- β19Updated 8 months ago
- [ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"β58Updated 6 months ago
- Official Code Repository for the paper "Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-intensive Tasksβ¦β37Updated 5 months ago
- β35Updated 6 months ago
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Mβ¦β11Updated 5 months ago
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.β27Updated 11 months ago
- β42Updated 2 months ago
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Modelsβ80Updated 7 months ago
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.β44Updated 6 months ago
- Official code for ICML 2024 paper on Persona In-Context Learning (PICLe)β23Updated 9 months ago
- β40Updated last year
- Official code repository for Correct-N-Contrastβ21Updated 2 years ago
- Public code repo for COLING 2025 paper "Aligning LLMs with Individual Preferences via Interaction"β26Updated 3 weeks ago