IBM / AssetOpsBenchLinks
AssetOpsBench - Industry 4.0
☆571Updated last week
Alternatives and similar repositories for AssetOpsBench
Users that are interested in AssetOpsBench are comparing it to the libraries listed below
Sorting:
- ☆304Updated 4 months ago
- EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other la…☆92Updated 3 weeks ago
- (ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…☆196Updated last month
- Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automat…☆177Updated last year
- ☆57Updated 2 months ago
- ☆71Updated this week
- Context is Key: A Benchmark for Forecasting with Essential Textual Information☆83Updated 4 months ago
- Efficient multi-prompt evaluation of LLMs☆25Updated last year
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessments☆243Updated this week
- ☆227Updated last month
- The Granite Guardian models are designed to detect risks in prompts and responses.☆123Updated 2 months ago
- Repo for "Adaptation of Agentic AI"☆223Updated this week
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆111Updated last year
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆159Updated last month
- Graph-R1: Towards Agentic GraphRAG Framework via End-to-end Reinforcement Learning☆450Updated 2 months ago
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆301Updated this week
- The AI Steerability 360 toolkit is an extensible library for general purpose steering of LLMs.☆54Updated last month
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.☆173Updated 2 weeks ago
- In-Context Explainability 360 toolkit☆51Updated this week
- Optimize Any User-defined Compound AI Systems☆63Updated 4 months ago
- ☆52Updated 9 months ago
- [EMNLP 2025 Demo] TinyScientist: A Lightweight Framework for Building Research Agents☆121Updated last month
- A method for steering llms to better follow instructions☆66Updated 4 months ago
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆162Updated 2 weeks ago
- ☆148Updated last year
- Latent Collaboration in Multi-Agent Systems☆615Updated this week
- ☆267Updated 5 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆153Updated last year
- A multi-agent framework to fully automate anomaly detection in different modalities, tabular, graph, time series, and more (work in progr…☆83Updated 6 months ago
- ☆162Updated last year