Trustworthy-ML-Lab / CB-LLMsView on GitHub
[ICLR 25] A novel framework for building intrinsically interpretable LLMs with human-understandable concepts to ensure safety, reliability, transparency, and trustworthiness.
30Feb 5, 2026Updated 3 weeks ago

Alternatives and similar repositories for CB-LLMs

Users that are interested in CB-LLMs are comparing it to the libraries listed below

Sorting:

Are these results useful?