[ICLR 25] A novel framework for building intrinsically interpretable LLMs with human-understandable concepts to ensure safety, reliability, transparency, and trustworthiness.
☆33Feb 5, 2026Updated 4 months ago
Alternatives and similar repositories for CB-LLMs
Users that are interested in CB-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official implementation of the Concept Discovery Models paper.☆15Aug 27, 2023Updated 2 years ago
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆19Dec 17, 2025Updated 6 months ago
- Hierarchical Universal Modular ANotator☆12May 9, 2026Updated last month
- Official PyTorch implementation for "Adaptive Multi-scale Online Likelihood Network for AI-assisted Interactive Segmentation" (MONet)☆12Mar 28, 2023Updated 3 years ago
- Concept-based generative models☆12Dec 13, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ICLR 2024: Energy-Based Concept Bottleneck Models: Unifying Prediction, Concept Intervention, and Probabilistic Interpretations☆23May 1, 2025Updated last year
- LCA-on-the-line (ICML 2024 Oral)☆14Feb 13, 2025Updated last year
- ☆24May 20, 2025Updated last year
- The core library of the DFKI multisensor pipeline framework.☆11May 23, 2022Updated 4 years ago
- Official codebase for FACMIC: Federated Adaptative CLIP Model for Medical Image Classification (Accepted at MICCAI 2024)☆14Jun 21, 2024Updated 2 years ago
- ☆34Nov 16, 2025Updated 7 months ago
- [ICLR 23] A new framework to transform any neural networks into an interpretable concept-bottleneck-model (CBM) without needing labeled c…☆143Apr 7, 2026Updated 2 months ago
- This is the official repository for the paper titled "Towards Interpretable Radiology Report Generation via Concept Bottlenecks using a M…☆17Apr 29, 2025Updated last year
- Concept Bottleneck Models, ICML 2020☆255Feb 24, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MultiPriv offers multilingual, multimodal PII entities and prompts for studying privacy risks in LLMs/VLMs. It also supports broader PII-…☆32Updated this week
- Official repo for ICML25 paper: DCBM: Data-Efficient Visual Concept Bottleneck Models☆31Sep 16, 2025Updated 9 months ago
- V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer (AAAI 2025)☆52Jul 5, 2025Updated 11 months ago
- [NeurIPS 24] A new training and evaluation framework for learning interpretable deep vision models and benchmarking different interpretab…☆34Jun 5, 2025Updated last year
- Official implementation for the paper "Controlled Sparsity via Constrained Optimization"☆12Aug 10, 2022Updated 3 years ago
- ☆13Jun 13, 2023Updated 3 years ago
- ☆10Jun 10, 2024Updated 2 years ago
- 【ICLR 2025 🔥】MMKE-Bench, a challenging benchmark for evaluating diverse semantic editing in real-world scenarios.☆23Apr 19, 2025Updated last year
- ☆12Feb 14, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Signal Propagation Perspective for Pruning Neural Networks at Initialization☆14Jun 23, 2020Updated 6 years ago
- Code for the paper "Post-hoc Concept Bottleneck Models". Spotlight @ ICLR 2023☆94May 20, 2024Updated 2 years ago
- [ICLR 2025] FLAT: LLM Unlearning via Loss Adjustment with Only Forget Data☆14Feb 26, 2025Updated last year
- The loss landscape of Large Language Models resemble basin!☆41Jul 8, 2025Updated 11 months ago
- ☆11Aug 20, 2025Updated 10 months ago
- something for paper agent☆11Dec 18, 2024Updated last year
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- Repository for the Paper: Leave My Images Alone: Preventing Multi-Modal Large Language Models from Analyzing Images via Visual Prompt Inj…☆19Apr 17, 2026Updated 2 months ago
- ☆12Aug 8, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A benchmark for evaluating the efficiency of LLM-generated code☆17Apr 17, 2025Updated last year
- ☆18Aug 19, 2024Updated last year
- The first comprehensive multimodal language analysis benchmark for evaluating foundation models☆31Sep 22, 2025Updated 9 months ago
- Resa: Transparent Reasoning Models via SAEs☆49Sep 23, 2025Updated 9 months ago
- Coverage-Guided Testing of Long Short-Term Memory (LSTM) Networks☆18Dec 15, 2020Updated 5 years ago
- Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews☆21Jul 1, 2025Updated last year
- [ACL 2025] Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms☆42Jun 4, 2025Updated last year