[ICLR 25] A novel framework for building intrinsically interpretable LLMs with human-understandable concepts to ensure safety, reliability, transparency, and trustworthiness.
☆33Feb 5, 2026Updated 4 months ago
Alternatives and similar repositories for CB-LLMs
Users that are interested in CB-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official implementation of the Concept Discovery Models paper.☆15Aug 27, 2023Updated 2 years ago
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆19Dec 17, 2025Updated 5 months ago
- Hierarchical Universal Modular ANotator☆12May 9, 2026Updated last month
- ICLR 2024: Energy-Based Concept Bottleneck Models: Unifying Prediction, Concept Intervention, and Probabilistic Interpretations☆23May 1, 2025Updated last year
- LCA-on-the-line (ICML 2024 Oral)☆14Feb 13, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official codebase for FACMIC: Federated Adaptative CLIP Model for Medical Image Classification (Accepted at MICCAI 2024)☆14Jun 21, 2024Updated last year
- ☆33Nov 16, 2025Updated 6 months ago
- Code for the paper: Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery. ECCV 2024.☆60Nov 3, 2024Updated last year
- Concept Bottleneck Models, ICML 2020☆255Feb 24, 2023Updated 3 years ago
- Official repo for ICML25 paper: DCBM: Data-Efficient Visual Concept Bottleneck Models☆30Sep 16, 2025Updated 8 months ago
- V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer (AAAI 2025)☆55Jul 5, 2025Updated 11 months ago
- This repository regroups learning ressources about performance estimation problems☆15Mar 18, 2026Updated 2 months ago
- A Signal Propagation Perspective for Pruning Neural Networks at Initialization☆14Jun 23, 2020Updated 5 years ago
- [ICLR 2025] FLAT: LLM Unlearning via Loss Adjustment with Only Forget Data☆14Feb 26, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The loss landscape of Large Language Models resemble basin!☆39Jul 8, 2025Updated 11 months ago
- ☆11Aug 20, 2025Updated 9 months ago
- Self-supervised learning course at CS HSE☆12May 24, 2023Updated 3 years ago
- Code Release for the 2023 NeurIPS Paper How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained langua…☆17Dec 6, 2024Updated last year
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- Diabetic Retinopathy Two-field image Dataset (DRTiD) & source code of Cross-Field Transformer for Diabetic Retinopathy Grading on Two-fie…☆28May 28, 2024Updated 2 years ago
- Repository for the Paper: Leave My Images Alone: Preventing Multi-Modal Large Language Models from Analyzing Images via Visual Prompt Inj…☆19Apr 17, 2026Updated last month
- ☆12Aug 8, 2023Updated 2 years ago
- Official code for ''RAG Meets Temporal Graphs: Time-Sensitive Modeling and Retrieval for Evolving Knowledge''.☆34Feb 25, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A benchmark for evaluating the efficiency of LLM-generated code☆17Apr 17, 2025Updated last year
- The official repository for Deformable ProtoPNet, as described in "Deformable ProtoPNet: An Interpretable Image Classifier Using Deformab…☆54Dec 3, 2024Updated last year
- ☆18Aug 19, 2024Updated last year
- ☆19Aug 4, 2025Updated 10 months ago
- Resa: Transparent Reasoning Models via SAEs☆49Sep 23, 2025Updated 8 months ago
- Code repository for "RL Grokking Recipe: How RL Unlocks and Transfers New Algorithms in LLMs""☆35Oct 12, 2025Updated 8 months ago
- ☆10Feb 17, 2024Updated 2 years ago
- Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews☆21Jul 1, 2025Updated 11 months ago
- [ACL 2025] Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms☆41Jun 4, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Nov 1, 2021Updated 4 years ago
- Official Repo for FoodieQA paper (EMNLP 2024)☆20Jun 26, 2025Updated 11 months ago
- ICCV25 highlight☆58Jan 7, 2026Updated 5 months ago
- Hyperparameter search and metric visualization tool for personal research.☆18Apr 19, 2026Updated last month
- Implementation of paper 'Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference' [NeurIPS'24…☆26Jun 14, 2024Updated last year
- [ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"☆16Feb 27, 2025Updated last year
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 4 years ago