Trustworthy-ML-Lab / CB-LLMsLinks

[ICLR 25] A novel framework for building intrinsically interpretable LLMs with human-understandable concepts to ensure safety, reliability, transparency, and trustworthiness.
21Updated last month

Alternatives and similar repositories for CB-LLMs

Users that are interested in CB-LLMs are comparing it to the libraries listed below

Sorting: