A curated list of explainability-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the explainability implications, challenges, and advancements surrounding these powerful models.
☆53Jun 25, 2025Updated 9 months ago
Alternatives and similar repositories for Awesome-LLM-Explainability
Users that are interested in Awesome-LLM-Explainability are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Experimental toolbox for quantum Shapley values.☆10Jan 2, 2024Updated 2 years ago
- A set of functions for well-known Cumulative Distribution Function (CDF)-based distance measure☆15Jan 5, 2024Updated 2 years ago
- Code for paper: Are Large Language Models Post Hoc Explainers?☆34Jul 22, 2024Updated last year
- ☆34Nov 26, 2024Updated last year
- [EMNLP 2023] Poisoning Retrieval Corpora by Injecting Adversarial Passages https://arxiv.org/abs/2310.19156☆49Dec 14, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The official implementation of Self-aware Object Detection [CVPR 2023]☆13Jun 30, 2023Updated 2 years ago
- This is the available code for the paper `evidential fully convolutional network for semantic segmentation (arXiv preprint arXiv:2103.135…☆14Jun 1, 2022Updated 3 years ago
- [ICCV 2023] HybridAugment++: Unified Frequency Spectra Perturbations for Model Robustness☆17Sep 28, 2023Updated 2 years ago
- Deep Learning & Information Bottleneck☆64Jun 30, 2023Updated 2 years ago
- ☆160Jan 15, 2024Updated 2 years ago
- ☆59Jun 5, 2024Updated last year
- [ICLR 2023, Spotlight] Indiscriminate Poisoning Attacks on Unsupervised Contrastive Learning☆31Dec 2, 2023Updated 2 years ago
- TFA project for indirect call analysis☆12Mar 13, 2025Updated last year
- Formal Verification of Neural Feedback Loops (NFLs)☆83Sep 12, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆113Sep 28, 2024Updated last year
- A depth-aware secure computation compiler☆17Jun 7, 2025Updated 9 months ago
- Run large AI models in TEE environment☆14Sep 10, 2024Updated last year
- 一个基于 LangGraph 和 LangSmith 构建的多智能体AI系统☆24Jun 5, 2025Updated 9 months ago
- Code for the website www.jailbreakchat.com☆119Aug 26, 2023Updated 2 years ago
- ☆14Mar 23, 2021Updated 5 years ago
- 一个可以快速开发的OA微服务系统,基于net6和AspNetCore开发,包含部门,岗位,用户,员工,角色权限,数据权限,事件总线,服务间调用,定时任务等功能,简单高效,能够用于快速开发和学习。☆17Jun 12, 2025Updated 9 months ago
- [ECCV 2020] Official code for "Comprehensive Image Captioning via Scene Graph Decomposition"☆99Aug 20, 2024Updated last year
- clock_plot provides a simple way to visualize timeseries data, mapping 24 hours onto the 360 degrees of a polar plot☆15Apr 5, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Self-supervised learning for wearables using the UK-Biobank (>700,000 person-days)☆150Oct 24, 2024Updated last year
- sealos deck☆11Mar 30, 2024Updated last year
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Nov 4, 2025Updated 4 months ago
- Building a quick conversation-based search demo with langchain.☆10Apr 2, 2024Updated last year
- Code to the paper: The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence☆25Jul 31, 2025Updated 7 months ago
- [CVPR 2025] GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration☆20Mar 21, 2025Updated last year
- Experiments from our work Uncertainty Quantification and Deep Ensemble☆10Nov 1, 2021Updated 4 years ago
- [ISSTA 2024] PatchFinder: A Two-Phase Approach to Security Patch Tracing for Disclosed Vulnerabilities in Open Source Software☆26Sep 13, 2025Updated 6 months ago
- ☆10Feb 3, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed …☆11Sep 27, 2024Updated last year
- An OpenAI-powered triage bot for a slack support channel designed to tag oncalls, prioritize issues, suggest solutions, and streamline co…☆12Jun 11, 2025Updated 9 months ago
- A cross-platform GPU monitor TUI with support for both Apple Silicon and NVIDIA GPUs.☆79Mar 5, 2026Updated 3 weeks ago
- This is the official repository of the following paper: "Achieving Fairness Through Channel Pruning for Dermatological Disease Diagnosis"…☆10Jan 4, 2025Updated last year
- ☆15Mar 13, 2023Updated 3 years ago
- Crashbench is a LLM benchmark to measure bug-finding and reporting capabilities of LLMs☆14Mar 8, 2026Updated 2 weeks ago
- Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos☆69Sep 5, 2025Updated 6 months ago