IBM / AssetOpsBenchLinks

AssetOpsBench - Industry 4.0

☆121

Alternatives and similar repositories for AssetOpsBench

Users that are interested in AssetOpsBench are comparing it to the libraries listed below

Sorting:

foundation-model-stack / fms-dgt
Synthetic Data Generation for Foundation Models
☆21Updated 5 months ago
AI4LIFE-GROUP / LLM_Explainer
Code for paper: Are Large Language Models Post Hoc Explainers?
☆33Updated 11 months ago
IBM / unitxt
🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …
☆206Updated this week
noahho / CAAFE
Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automat…
☆164Updated 6 months ago
IBM / eval-assist
EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other la…
☆59Updated this week
giorgiopiatti / GovSim
Governance of the Commons Simulation (GovSim)
☆55Updated 5 months ago
ibm-granite / granite-guardian
The Granite Guardian models are designed to detect risks in prompts and responses.
☆91Updated 3 weeks ago
rushrukh / awesome-explainable-ai
A repository for summaries of recent explainable AI/Interpretable ML approaches
☆80Updated 9 months ago
ulab-uiuc / MARBLE
(ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…
☆132Updated this week
AI-secure / aug-pe
[ICML 2024 Spotlight] Differentially Private Synthetic Data via Foundation Model APIs 2: Text
☆41Updated 6 months ago
koo-ec / Awesome-LLM-Explainability
A curated list of explainability-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to…
☆36Updated 3 weeks ago
ronigold / TokenSHAP
TokenSHAP: Explain individual token importance in large language model prompts with SHAP values. Gain insights, debug models, detect bias…
☆46Updated 3 months ago
interpretml / LLM-Tabular-Memorization-Checker
Testing Language Models for Memorization of Tabular Datasets.
☆34Updated 5 months ago
MiaoXiong2320 / llm-uncertainty
code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"
☆122Updated last year
songqiaohu / THU-Concept-Drift-Datasets-v1.0
📖These are the concept drift datasets we made, and we open-source the data and corresponding interfaces. Welcome to use them for free if…
☆31Updated last year
causalNLP / cladder
We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.
☆118Updated last year
causalNLP / corr2cause
Data and code for the Corr2Cause paper (ICLR 2024)
☆106Updated last year
py-why / pywhyllm
Experimental library integrating LLM capabilities to support causal analyses
☆224Updated 2 weeks ago
johnnyhwu / Awesome-LLM-Tabular
Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data
☆404Updated 6 months ago
fangru-lin / graph-llm-asynchow-plan
Code and dataset repo for ICML-2024 paper Graph-enhanced Large Language Models in Asynchronous Plan Reasoning.
☆63Updated 3 months ago
dylan-slack / Tablet
The TABLET benchmark for evaluating instruction learning with LLMs for tabular prediction.
☆21Updated 2 years ago
Awenbocc / LLM-OOD
☆12Updated 11 months ago
microsoft / ConstrainedReasoner
☆11Updated 10 months ago
microsoft / DPSDA
Private Evolution: Generating DP Synthetic Data without Training [ICLR 2024, ICML 2024 Spotlight]
☆99Updated this week
ServiceNow / context-is-key-forecasting
Context is Key: A Benchmark for Forecasting with Essential Textual Information
☆66Updated this week
vinid / NegotiationArena
☆72Updated last year
IBM / ACPBench
ACPBench: Reasoning about Action, Change, and Planning
☆24Updated 2 months ago
tanfiona / LLM-on-Tabular-Data-Prediction-Table-Understanding-Data-Generation
Repository for collecting and categorizing papers outlined in our survey paper: "Large Language Models on Tabular Data -- A Survey".
☆161Updated 9 months ago
opendataval / opendataval
OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)
☆99Updated 5 months ago
AI4LIFE-GROUP / OpenXAI
OpenXAI : Towards a Transparent Evaluation of Model Explanations
☆247Updated 11 months ago