IBM / ICX360Links
In-Context Explainability 360 toolkit
☆65Updated 2 weeks ago
Alternatives and similar repositories for ICX360
Users that are interested in ICX360 are comparing it to the libraries listed below
Sorting:
- The Granite Guardian models are designed to detect risks in prompts and responses.☆130Updated 3 months ago
- Mellea is a library for writing generative programs.☆303Updated last week
- The Agent Lifecycle Toolkit (ALTK) is a library of components to help agent builders improve their agent with minimal integration effort …☆109Updated last week
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆212Updated 2 weeks ago
- The AI Steerability 360 toolkit is an extensible library for general purpose steering of LLMs.☆76Updated 2 weeks ago
- EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other la…☆93Updated 2 months ago
- Steering vectors for transformer language models in Pytorch / Huggingface☆140Updated 11 months ago
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆88Updated 11 months ago
- Python framework which enables you to transform how a user calls or infers an IBM Granite model and how the output from the model is retu…☆54Updated last week
- Persona Vectors: Monitoring and Controlling Character Traits in Language Models☆344Updated 6 months ago
- ☆43Updated last year
- ☆91Updated last month
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.☆174Updated 2 weeks ago
- Collection of evals for Inspect AI☆357Updated this week
- ☆257Updated 3 weeks ago
- ☆112Updated 11 months ago
- Governance of the Commons Simulation (GovSim)☆64Updated last year
- ☆261Updated 10 months ago
- AI Atlas Nexus: tooling to bring together resources related to governance of foundation models.☆114Updated last week
- An attribution library for LLMs☆46Updated last year
- A toolkit for describing model features and intervening on those features to steer behavior.☆227Updated last month
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆83Updated last year
- ☆152Updated 4 months ago
- ☆50Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆268Updated 3 weeks ago
- Official code for NeurIPS 2025 paper "AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise"☆126Updated 2 weeks ago
- An open-source compliance-centered evaluation framework for Generative AI models☆179Updated this week
- ☆52Updated 10 months ago
- Sphynx Hallucination Induction☆52Updated last year
- Top papers related to LLM-based agent evaluation☆89Updated 3 months ago