IBM / ICX360Links
In-Context Explainability 360 toolkit
☆28Updated this week
Alternatives and similar repositories for ICX360
Users that are interested in ICX360 are comparing it to the libraries listed below
Sorting:
- EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other la…☆77Updated this week
- Risk Atlas Nexus: tooling to bring together resources related to governance of foundation models.☆98Updated this week
- Mellea is a library for writing generative programs.☆142Updated this week
- AI Verify☆32Updated this week
- Open source project for data preparation for GenAI applications☆800Updated this week
- Synthetic Data Generation for Foundation Models☆21Updated 7 months ago
- Granite Snack Cookbook -- easily consumable recipes (python notebooks) that showcase the capabilities of the Granite models☆260Updated last week
- The Granite Guardian models are designed to detect risks in prompts and responses.☆115Updated this week
- ☆45Updated last year
- Collection of evals for Inspect AI☆233Updated this week
- ☆26Updated 4 months ago
- A curated list of awesome synthetic data tools (open source and commercial).☆206Updated last year
- Interpretability and explainability of data and machine learning models☆1,734Updated 6 months ago
- Data Privacy Toolkit☆39Updated last month
- Inspect: A framework for large language model evaluations☆1,336Updated this week
- Run safety benchmarks against AI models and view detailed reports showing how well they performed.☆104Updated this week
- Prompt Declaration Language (PDL) is a declarative prompt programming language.☆229Updated this week
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆208Updated this week
- LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR t…☆481Updated 7 months ago
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessments☆232Updated last week
- Run the entire bee application stack using docker-compose☆155Updated 6 months ago
- Discover, run, and compose AI agents from any framework.☆765Updated this week
- Examples and guides for building Gen AI applications on the watsonx platform.☆36Updated last week
- AI Agents, LLM Fine-tuning, Developer Productivity, Governance, IBM watsonx☆40Updated this week
- IBM-Generative-AI is a Python library built on IBM's large language model REST interface to seamlessly integrate and extend this service …☆258Updated 9 months ago
- Software for evaluating the quality of synthetic data compared with real data.☆30Updated 5 months ago
- Moonshot - A simple and modular tool to evaluate and red-team any LLM application.☆270Updated 2 weeks ago
- Benchmarks for the Evaluation of LLM Supervision☆32Updated 2 months ago
- A Comprehensive Assessment of Trustworthiness in GPT Models☆303Updated last year
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data☆42Updated this week