reference implementation for "explanations can be manipulated and geometry is to blame"
☆37Jul 24, 2022Updated 3 years ago
Alternatives and similar repositories for adv_explanation_ref
Users that are interested in adv_explanation_ref are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Showing the relationship between ImageNet ID and labels and pytorch pre-trained model output ID and labels☆10Oct 11, 2020Updated 5 years ago
- Code-repository for the ICML 2020 paper Fairwashing explanations with off-manifold detergent☆12Dec 18, 2020Updated 5 years ago
- This repository provides a PyTorch implementation of "Fooling Neural Network Interpretations via Adversarial Model Manipulation". Our pap…☆23Dec 19, 2020Updated 5 years ago
- ☆14Dec 4, 2023Updated 2 years ago
- ☆114Nov 21, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This repository contains the code for implementing Bidirectional Relevance scores for Digital Histopathology, which was used for the resu…☆16Mar 24, 2023Updated 3 years ago
- ☆52Aug 29, 2020Updated 5 years ago
- Implemention of "Robust Watermarking of Neural Network with Exponential Weighting" in TensorFlow.☆13Dec 2, 2020Updated 5 years ago
- Python implementation for evaluating explanations presented in "On the (In)fidelity and Sensitivity for Explanations" in NeurIPS 2019 for…☆25Feb 23, 2022Updated 4 years ago
- Code for the Paper 'On the Connection Between Adversarial Robustness and Saliency Map Interpretability' by C. Etmann, S. Lunz, P. Maass, …☆16May 9, 2019Updated 6 years ago
- 💡 Adversarial attacks on explanations and how to defend them☆334Nov 30, 2024Updated last year
- ViRelAy is a visualization tool for the analysis of data as generated by CoRelAy.☆31Updated this week
- PyTorch implementation of SmoothTaylor☆15Sep 5, 2021Updated 4 years ago
- Zennit is a high-level framework in Python using PyTorch for explaining/exploring neural networks using attribution methods like LRP.☆243Jan 30, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A collection of different ways to implement accessing and modifying internal model activations for LLMs☆24Oct 18, 2024Updated last year
- ☆12May 26, 2022Updated 3 years ago
- Code for the paper "Adversarial Neural Pruning with Latent Vulnerability Suppression"☆15Nov 23, 2022Updated 3 years ago
- Imbalanced Gradients: A New Cause of Overestimated Adversarial Robustness. (MD attacks)☆11Aug 29, 2020Updated 5 years ago
- Pruning CNN using CNN with toy example☆23Jun 21, 2021Updated 4 years ago
- ☆18Jul 26, 2024Updated last year
- Framework-agnostic implementation for state-of-the-art saliency methods (XRAI, BlurIG, SmoothGrad, and more).☆994Mar 20, 2024Updated 2 years ago
- Code for Net2Vec: Quantifying and Explaining how Concepts are Encoded by Filters in Deep Neural Networks☆30Feb 8, 2018Updated 8 years ago
- Multi-dimensional analysis of orthogonal safety directions in LLM alignment☆22Mar 20, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official Code for Scaling Adversarial Training to Large Perturbation Bounds (ECCV-2022)☆11Nov 25, 2022Updated 3 years ago
- RAG Hallucination Detecting By LRP.☆11Mar 31, 2025Updated last year
- Code for paper "Poisoned classifiers are not only backdoored, they are fundamentally broken"☆26Jan 7, 2022Updated 4 years ago
- ☆26Oct 26, 2020Updated 5 years ago
- An eXplainable AI toolkit with Concept Relevance Propagation and Relevance Maximization☆142Jan 14, 2026Updated 3 months ago
- Code for Fong and Vedaldi 2017, "Interpretable Explanations of Black Boxes by Meaningful Perturbation"☆32Sep 25, 2019Updated 6 years ago
- code release for the paper "On Completeness-aware Concept-Based Explanations in Deep Neural Networks"☆54Mar 25, 2022Updated 4 years ago
- some generic (but hopefully still useful) recommendations on writing your thesis☆11Mar 24, 2023Updated 3 years ago
- Repo for the paper "Exploiting redundancy in large materials datasets for efficient machine learning with less data"☆11Sep 23, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- My code solutions to exercises of Bayesian Reasoning and Machine Learning☆19Sep 2, 2021Updated 4 years ago
- Explaining Image Classifiers by Counterfactual Generation☆28Apr 23, 2022Updated 4 years ago
- ☆14Sep 30, 2019Updated 6 years ago
- TensorFlow 2.0 + Keras guide by François Chollet for deep learning researchers.☆15Oct 6, 2019Updated 6 years ago
- Bayes-Adaptive Monte-Carlo Planning algorithm☆18Mar 5, 2013Updated 13 years ago
- ☆14Jun 12, 2023Updated 2 years ago
- Weight-Averaged Sharpness-Aware Minimization (NeurIPS 2022)☆28Jan 13, 2023Updated 3 years ago