A curated list of explainability-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the explainability implications, challenges, and advancements surrounding these powerful models.
☆55Jun 25, 2025Updated 9 months ago
Alternatives and similar repositories for Awesome-LLM-Explainability
Users that are interested in Awesome-LLM-Explainability are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A set of functions for well-known Cumulative Distribution Function (CDF)-based distance measure☆15Jan 5, 2024Updated 2 years ago
- ☆17Aug 17, 2021Updated 4 years ago
- The official implementation of the paper "Towards Safe Self-Distillation of Internet-Scale Text-to-Image Diffusion Models" (ICML 2023 Wor…☆22Mar 19, 2024Updated 2 years ago
- [NeurIPS 23] Characterizing OOD Error via Optimal Transport☆13Nov 19, 2023Updated 2 years ago
- Code for paper: Are Large Language Models Post Hoc Explainers?☆34Jul 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆34Nov 26, 2024Updated last year
- [EMNLP 2023] Poisoning Retrieval Corpora by Injecting Adversarial Passages https://arxiv.org/abs/2310.19156☆50Dec 14, 2023Updated 2 years ago
- Collection of scripts for preparation of datasets for semantic segmentation of UAV images☆15Jun 21, 2022Updated 3 years ago
- Reliability_Multirotor_Drones☆13Nov 9, 2024Updated last year
- Exploring techniques for estimating safety of machine learning classifiers☆78Feb 21, 2025Updated last year
- This is the available code for the paper `evidential fully convolutional network for semantic segmentation (arXiv preprint arXiv:2103.135…☆13Jun 1, 2022Updated 3 years ago
- [ICCV 2023] HybridAugment++: Unified Frequency Spectra Perturbations for Model Robustness☆17Sep 28, 2023Updated 2 years ago
- Deep Learning & Information Bottleneck☆64Jun 30, 2023Updated 2 years ago
- ☆59Jun 5, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Formal Verification of Neural Feedback Loops (NFLs)☆83Sep 12, 2024Updated last year
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆113Sep 28, 2024Updated last year
- ☆14Mar 23, 2021Updated 5 years ago
- Parse Symcat (http://www.symcat.com) symptoms and conditions and generate valid Synthea (https://github.com/synthetichealth/synthea) modu…☆16Jan 28, 2021Updated 5 years ago
- ☆130May 31, 2024Updated last year
- Automated Question-Answering Over Knowledge Graphs in O&M of Wind Turbines☆13Aug 16, 2022Updated 3 years ago
- Code for the paper, From RAG to QA-RAG: Integrating Generative AI for Pharmaceutical Regulatory Compliance Process☆24Sep 9, 2024Updated last year
- clock_plot provides a simple way to visualize timeseries data, mapping 24 hours onto the 360 degrees of a polar plot☆15Apr 5, 2022Updated 4 years ago
- The MobSTr dataset provides artifacts that demonstrate Model-based Safety Assurance and Traceability for a safety-critical automotive sys…☆10Mar 18, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Self-supervised learning for wearables using the UK-Biobank (>700,000 person-days)☆149Oct 24, 2024Updated last year
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆93Apr 30, 2024Updated last year
- Building a quick conversation-based search demo with langchain.☆10Apr 2, 2024Updated 2 years ago
- [CVPR 2025] GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration☆20Mar 21, 2025Updated last year
- ☆187Jul 2, 2025Updated 9 months ago
- ☆24Sep 16, 2022Updated 3 years ago
- ☆10Feb 3, 2021Updated 5 years ago
- The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed …☆11Sep 27, 2024Updated last year
- This repository provides a benchmark for prompt injection attacks and defenses in LLMs☆426Oct 29, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Instruction Following Eval☆16Jan 16, 2025Updated last year
- Code for RELAX, a framework for explaining representations.☆12Jan 7, 2024Updated 2 years ago
- Mixture of Experts from scratch☆13Apr 12, 2024Updated 2 years ago
- Official code for ST-RoomNet☆21Nov 24, 2023Updated 2 years ago
- Unveiling and Mitigating Bias in Mental Health Analysis with Large Language Models☆12Jun 21, 2024Updated last year
- [USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models☆254Jan 27, 2026Updated 2 months ago
- ☆27May 20, 2025Updated 10 months ago