IBM / AssetOpsBenchLinks
AssetOpsBench - Industry 4.0
☆900Updated last week
Alternatives and similar repositories for AssetOpsBench
Users that are interested in AssetOpsBench are comparing it to the libraries listed below
Sorting:
- In-Situ Evaluator: Real-Time Subsample Analysis☆15Updated last week
- EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other la…☆93Updated 2 months ago
- ☆328Updated 6 months ago
- ☆59Updated 2 months ago
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessments☆251Updated 3 weeks ago
- The AI Steerability 360 toolkit is an extensible library for general purpose steering of LLMs.☆76Updated 2 weeks ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆152Updated last year
- GSM-Symbolic templates and generated data☆80Updated last year
- The Granite Guardian models are designed to detect risks in prompts and responses.☆130Updated 3 months ago
- In-Context Explainability 360 toolkit☆65Updated 2 weeks ago
- A framework for interpreting modern AI systems using Monte Carlo Shapley value estimation. Model-agnostic explainability across language …☆71Updated 3 weeks ago
- (ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…☆213Updated 3 months ago
- ☆75Updated 3 weeks ago
- [EMNLP 2025 Demo] TinyScientist: A Lightweight Framework for Building Research Agents☆126Updated 3 months ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆356Updated 2 weeks ago
- ☆237Updated last month
- Persona Vectors: Monitoring and Controlling Character Traits in Language Models☆344Updated 6 months ago
- ☆52Updated 10 months ago
- Run safety benchmarks against AI models and view detailed reports showing how well they performed.☆117Updated this week
- Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok …☆30Updated last month
- This repository collects all relevant resources about interpretability in LLMs☆390Updated last year
- ☆59Updated 4 months ago
- Discovering Data-driven Hypotheses in the Wild☆128Updated 7 months ago
- ⏰ AI conference deadline countdowns☆320Updated last week
- ☆270Updated 7 months ago
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆163Updated 2 months ago
- ☆519Updated 6 months ago
- This is an open-source tool to assess and improve the trustworthiness of AI systems.☆103Updated last week
- large population models☆567Updated this week
- ☆53Updated last year