[ICLR 25] A novel framework for building intrinsically interpretable LLMs with human-understandable concepts to ensure safety, reliability, transparency, and trustworthiness.
☆32Feb 5, 2026Updated 2 months ago
Alternatives and similar repositories for CB-LLMs
Users that are interested in CB-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official implementation of the Concept Discovery Models paper.☆15Aug 27, 2023Updated 2 years ago
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆18Dec 17, 2025Updated 4 months ago
- Concept-based generative models☆12Dec 13, 2024Updated last year
- ☆21May 20, 2025Updated 11 months ago
- Official PyTorch implementation of WPS from our paper: WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models☆14Jun 12, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆32Nov 16, 2025Updated 5 months ago
- Discover the repository for "Advancing Generalizable Tumor Segmentation with Anomaly-Aware Open-Vocabulary Attention Maps and Frozen Foun…☆21Mar 22, 2025Updated last year
- [ICLR 23] A new framework to transform any neural networks into an interpretable concept-bottleneck-model (CBM) without needing labeled c…☆140Apr 7, 2026Updated 3 weeks ago
- This is the official repository for the paper titled "Towards Interpretable Radiology Report Generation via Concept Bottlenecks using a M…☆16Apr 29, 2025Updated last year
- Concept Bottleneck Models, ICML 2020☆251Feb 24, 2023Updated 3 years ago
- Official implementation for the paper "Controlled Sparsity via Constrained Optimization"☆12Aug 10, 2022Updated 3 years ago
- Official repo for ICML25 paper: DCBM: Data-Efficient Visual Concept Bottleneck Models☆30Sep 16, 2025Updated 7 months ago
- [NeurIPS 24] A new training and evaluation framework for learning interpretable deep vision models and benchmarking different interpretab…☆32Jun 5, 2025Updated 10 months ago
- ☆10Jun 10, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This repository regroups learning ressources about performance estimation problems☆15Mar 18, 2026Updated last month
- Code for the paper "Post-hoc Concept Bottleneck Models". Spotlight @ ICLR 2023☆93May 20, 2024Updated last year
- The loss landscape of Large Language Models resemble basin!☆37Jul 8, 2025Updated 9 months ago
- ☆10Aug 20, 2025Updated 8 months ago
- something for paper agent☆11Dec 18, 2024Updated last year
- ☆24Aug 29, 2025Updated 8 months ago
- Code Release for the 2023 NeurIPS Paper How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained langua…☆17Dec 6, 2024Updated last year
- Repository for the Paper: Leave My Images Alone: Preventing Multi-Modal Large Language Models from Analyzing Images via Visual Prompt Inj…☆19Apr 17, 2026Updated 2 weeks ago
- ☆12Aug 8, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A benchmark for evaluating the efficiency of LLM-generated code☆17Apr 17, 2025Updated last year
- The official repository for Deformable ProtoPNet, as described in "Deformable ProtoPNet: An Interpretable Image Classifier Using Deformab…☆53Dec 3, 2024Updated last year
- ☆18Aug 19, 2024Updated last year
- ☆19Aug 4, 2025Updated 8 months ago
- ☆10Feb 17, 2024Updated 2 years ago
- Finetune Google's pre-trained ViT models from HuggingFace's model hub.☆19Apr 4, 2021Updated 5 years ago
- ☆27Jul 6, 2024Updated last year
- [ACL 2025] Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms☆41Jun 4, 2025Updated 10 months ago
- ☆10Nov 1, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Official Repo for FoodieQA paper (EMNLP 2024)☆20Jun 26, 2025Updated 10 months ago
- Hyperparameter search and metric visualization tool for personal research.☆18Apr 19, 2026Updated last week
- 在监控画质下实现对校园自行车的重识别,包含REID模型识别,向量数据库检索,UI展示☆11Feb 13, 2024Updated 2 years ago
- Multimodal Classification and Out-of-distribution Detection☆18Apr 4, 2025Updated last year
- Implementation of paper 'Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference' [NeurIPS'24…☆26Jun 14, 2024Updated last year
- Official code for "Bridging the Gap between Classification and Localization for Weakly Supervised Object Localization (CVPR 2022)"☆29Aug 14, 2023Updated 2 years ago
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Mar 20, 2025Updated last year