MedSafetyBench: Evaluating and Improving the Medical Safety of LLMs, NeurIPS 2024
☆48Dec 4, 2025Updated 6 months ago
Alternatives and similar repositories for med-safety-bench
Users that are interested in med-safety-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆33Mar 7, 2026Updated 3 months ago
- Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Findings]"☆18Aug 27, 2025Updated 9 months ago
- ☆18Feb 19, 2023Updated 3 years ago
- "How to Trust Your Diffusion Models: A Convex Optimization Approach to Conformal Risk Control"☆17Jan 6, 2026Updated 5 months ago
- The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…☆11Jul 28, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Automatic diagnosis of alzheimer☆11Apr 19, 2019Updated 7 years ago
- Code for paper [Explaining image classifiers by removing input features using generative models] [ACCV 2020] https://arxiv.org/abs/1910.0…☆15Nov 22, 2022Updated 3 years ago
- Official Code for "Large-scale Self-supervised Video Foundation Model for Intelligent Surgery"☆48Jun 4, 2025Updated last year
- RL Implementation☆19May 10, 2022Updated 4 years ago
- The Official Repo for Paper: Aligning Clinical Needs and AI Capabilities: A Survey on LLMs for Medical Reasoning☆23Apr 7, 2026Updated 2 months ago
- PMC-VQA is a large-scale medical visual question-answering dataset, which contains 227k VQA pairs of 149k images that cover various modal…☆233Dec 6, 2024Updated last year
- ☆12Jan 10, 2023Updated 3 years ago
- A collection of ETLs from common data formats to Medical Event Data Standard☆40Aug 5, 2025Updated 10 months ago
- Data for the NeurIPS 2021 paper [The effectiveness of feature attribution methods and its correlation with automatic evaluation scores] …☆18Jan 17, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- DG-TTA☆14Apr 3, 2025Updated last year
- Radiology Language Evaluations☆11Nov 17, 2023Updated 2 years ago
- ☆19Jul 13, 2025Updated 10 months ago
- ☆13Aug 19, 2024Updated last year
- ☆29Aug 3, 2021Updated 4 years ago
- IJCAI-24 Tutorial on Counterfactual Explanations: https://sites.google.com/view/tut-counterfactuals-ijcai24/☆12Aug 5, 2024Updated last year
- DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Models (NeurIPS 2024 D&B Track)☆24Mar 6, 2025Updated last year
- ☆14Dec 30, 2021Updated 4 years ago
- MCP server integrating GEPA (Genetic-Evolutionary Prompt Architecture) for automatic prompt optimization with Claude Desktop☆48Nov 10, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official Codes for "Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes"☆119Aug 22, 2024Updated last year
- Safety-J: Evaluating Safety with Critique☆16Jul 28, 2024Updated last year
- ☆14Mar 8, 2025Updated last year
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆86Feb 5, 2024Updated 2 years ago
- TransEdge: Translating Relation-contextualized Embeddings for Knowledge Graphs, ISWC 2019☆28Dec 21, 2019Updated 6 years ago
- Implementation of Concept-level Debugging of Part-Prototype Networks☆12May 9, 2023Updated 3 years ago
- Dataset for medical question summarization introduced in the ACL 2019 paper "On the Summarization of Consumer Health Questions" (A. Ben A…☆33May 13, 2026Updated 3 weeks ago
- FunnyBirds: A Synthetic Vision Dataset for a Part-Based Analysis of Explainable AI Methods (ICCV 2023)☆17Apr 8, 2024Updated 2 years ago
- ☆24Jan 11, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Langchain_CrewAI_Gemini - An Gemini AI powered AI Agent (Multi-Agent) Project.☆14Mar 24, 2024Updated 2 years ago
- Verilog implementation of a simple riscv cpu☆19Oct 28, 2021Updated 4 years ago
- Neural Conversation Models☆10Jul 9, 2020Updated 5 years ago
- Mathematical consequences of orthogonal weights initialization and regularization in deep learning. Experiments with gain-adjusted orthog…☆17Sep 21, 2019Updated 6 years ago
- Large Language-and-Vision Assistant for BioMedicine, built towards multimodal GPT-4 level capabilities.☆10Nov 29, 2023Updated 2 years ago
- ☆11Aug 16, 2020Updated 5 years ago
- ☆11Mar 3, 2020Updated 6 years ago